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ANALYSIS OF SPDES ARISING IN PATH SAMPLING 
PART II: THE NONLINEAR CASE 

By M. Hairer 1 , A. M. Stuart 1,2 and J. Voss 2 

University of Warwick 

In many applications, it is important to be able to sample paths of 
SDEs conditional on observations of various kinds. This paper stud- 
ies SPDEs which solve such sampling problems. The SPDE may be 
viewed as an infinite-dimensional analogue of the Langevin equation 
used in finite-dimensional sampling. In this paper, conditioned non- 
linear SDEs, leading to nonlinear SPDEs for the sampling, are stud- 
ied. In addition, a class of preconditioned SPDEs is studied, found 
by applying a Green's operator to the SPDE in such a way that 
the invariant measure remains unchanged; such infinite dimensional 
evolution equations are important for the development of practical 
algorithms for sampling infinite dimensional problems. 

The resulting SPDEs provide several significant challenges in the 
theory of SPDEs. The two primary ones are the presence of nonlinear 
boundary conditions, involving first order derivatives, and a loss of 
the smoothing property in the case of the pre-conditioned SPDEs. 
These challenges are overcome and a theory of existence, uniqueness 
and ergodicity is developed in sufficient generality to subsume the 
sampling problems of interest to us. The Gaussian theory developed 
in Part I of this paper considers Gaussian SDEs, leading to linear 
Gaussian SPDEs for sampling. This Gaussian theory is used as the 
basis for deriving nonlinear SPDEs which affect the desired sampling 
in the nonlinear case, via a change of measure. 

1. Introduction. The purpose of this paper is to provide rigorous jus- 
tification for a recently introduced stochastic partial differential equation 
(SPDE)-based approach to infinite dimensional sampling problems [14, 22]. 
The methodology has been developed to solve a number of sampling prob- 
lems arising from stochastic differential equations (SDEs — assumed to be 
finite-dimensional unless stated otherwise), conditional on observations. 
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The setup is as follows. Consider the SDE 

(1.1) dX = AXdu + f(X)du + BdW x , X(0)=x~, 

where f(x) = -BB*W(x), V: R d — > R, Be R dxd is invertible and W x is 
a standard d-dimensional Brownian motion. We consider three sampling 
problems associated with (1.1): 

1. free path sampling, to sample paths of (1.1) unconditionally; 

2. bridge path sampling, to sample paths of (1.1) conditional on knowing 
X(l) = x+; 

3. nonlinear filter/ smoother, to sample paths of (1.1), conditional on 
knowledge of (V(w)) u e[o,i] solving 

(1.2) dY = AXdt + BdW y , Y{0) = 0, 

where A G R mxd is arbitrary, B G K mxm is invertible and W y is a standard 
m-dimensional Brownian motion. 

The methodology proposed in [22] is to extend the finite dimensional 
Langevin sampling technique [21] to infinite-dimensional problems such as 
those listed above as 1 to 3. This leads to SPDEs which are ergodic and 
have stationary measure which solves the desired sampling problem. 

We believe that an infinite dimensional sampling technique can be derived 
by taking the (formal) density of the target distribution and mimicking the 
procedure from the finite dimension Langevin method. In this paper, we 
provide a rigorous justification for this claim in the case of equation (1.1), 
where the drift is linear plus a gradient, the noise is additive and, in case 3, 
observations arise linearly, as in (1.2). A conjecture for the case of general 
drift, and for a nonlinear observation equation in place of (1.2), is described 
in Section 9 at the end of the paper. 

For the problems considered here, the resulting SDPEs are of the form 

(1.3) dx = (BB*y 1 dlxdt-V<Z>(x)dt + V2dw(t), 

and generalizations, where w is a cylindrical Wiener process (so that ^ 
is space-time white noise) and $ is some real- valued function on R d . [Note 
that the "potential" $ is different from V; see (5.3) below.] For problem 
1, the resulting SPDE is not a useful algorithmic framework in practice as 
it is straightforward to generate unconditioned, independent samples from 
1 by application of numerical methods for SDE initial value problems [15]; 
the Langevin method generates correlated samples and, hence, has larger 
variance in any resulting estimators. However, we include analysis of problem 
1 because it contributes to the understanding of subsequent SPDE-based 
approaches. For problems 2 and 3, we believe that the proposed methodology 
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is, potentially, the basis for efficient Markov chain Monte Carlo (MCMC)- 
based sampling techniques. Some results about how such an MCMC method 
could be implemented in practice can be found in [1] and [13]. 

The resulting MCMC method, when applied to problem 3, results in a new 
method for solving nonlinear filtering/smoothing problems. This method 
differs substantially from traditional methods like particle filters which are 
based on the Zakai equation. While the latter equation describes the density 
of the conditional distribution of the signal at fixed times t, our proposed 
method samples full paths from the conditional distribution; statistical quan- 
tities can then be obtained by considering ergodic averages. Consequently, 
while the proposed method cannot easily be applied in online situations, it 
provides dynamic information on the paths, which cannot be so easily read 
off the solutions of the Zakai equation. Another difference is that the inde- 
pendent variables in the Zakai equation are in whereas equation (1.3) 
is always indexed by [0,oo) x [0,1] and only takes values in M. d . Thus, the 
proposed method should be advantageous in high dimensions. For further 
discussion and applications, see [13]. 

In making such methods as efficient as possible, we are lifting ideas from 
finite-dimensional Langevin sampling into our infinite-dimensional situation. 
One such method is to use preconditioning, which changes the evolution 
equation, whilst preserving the stationary measure, in an attempt to roughly 
equalize relaxation rates in all modes. This leads to SPDEs of the form 

(1.4) dx = G{BB*)- l dlxdt - gV$(x) dt + V2G 1/2 dw(t), 

and generalizations, where w, again, is a cylindrical Wiener process. In the 
finite-dimensional case, it is well known that the invariant measure for (1.4) 
is the same as for (1.3). In this paper, we will study the methods proposed in 
[1] which precondition the resulting infinite-dimensional evolution equation 
(1.3) by choosing Q as a Green's operator. We show that equation (1.4), in 
its stationary measure, still samples from the desired distribution. 

For both preconditioned and unpreconditioned equations, the analysis 
leads to mathematical challenges. First, when we are not conditioning on 
the endpoint (in problems 1 and 3), we obtain an SPDE with a nonlinear 
boundary condition of the form 

d u x(t,l) = f(x(t,l)) Vte(0,oo), 

where / is the drift of the SDE (1.1). In the abstract formulation using 
Hilbert-space-valued equations, this translates into an additional drift term 
of the form f(x(t, l))#i, where 5± is the delta distribution at u = l. This 
forces us to consider equations with values in the Banach space of continuous 
functions (so that we can evaluate the solution x at the point u = 1) and 
to allow the drift to take distributions as values. Unfortunately, the theory 
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for this situation is not well developed in the literature. Therefore, we here 
provide proofs for the existence and uniqueness of solutions for the SPDEs 
considered. This machinery is not required for problem 2, as the Hilbert 
space setting [4, 5, 6, 25] can be used there. 

We also prove ergodicity of these SPDEs. Here, a second challenge comes 
from the fact that we consider the preconditioned equation (1.4). Since we 
want to precondition with operators Q which are close to (<9„) _1 , it is not 
possible to use smoothing properties of the heat semigroup anymore and 
the resulting process no longer has the strong Feller property. Instead, we 
show that the process has the recently introduced asymptotic strong Feller 
property (see [12]) and use this to show existence of a unique stationary 
measure for the preconditioned case. 

The paper is split into two parts. The first part, consisting of Sections 2, 3 
and 4, introduces the general framework, while the second part, starting at 
Section 5, uses this framework to solve the three sampling problems stated 
above. Readers only interested in the applications can safely skip the first 
part on first reading. The topics presented there are mainly required to 
understand the proofs in the second part. 

The two parts are organized as follows. In Section 2, we introduce the 
technical framework required to give sense to equations like (1.3) and (1.4) 
as Hilbert-space-valued SDEs. The main results of this section are Theorem 
3.4 and 3.6, showing the global existence of solutions of these SDEs. In 
Section 3, we identify a stationary distribution of these equations. This result 
is a generalization of a result by Zabczyk [25]; the generalization allows 
us to consider the Banach-space-valued setting required for the nonlinear 
boundary conditions and is also extended to consider the preconditioned 
SPDEs. In Section 4, we show that the stationary distribution is unique 
and that the considered equations are ergodic (see Theorems 4.10 and 4.11). 
This justifies their use as the basis for an MCMC method. 

In the second part of the paper, we apply the abstract theory to derive 
SPDEs which sample conditioned diffusions. Section 5 outlines the method- 
ology. Then, in Sections 6, 7 and 8, we discuss the sampling problems 1, 2 
and 3, respectively, proving the desired property for both the SPDEs pro- 
posed in [22] and the preconditioned method proposed in [1]. In the case 2, 
bridges, the SPDE whose invariant measure is the bridge measure was also 
derived in one dimension in [20]. In Section 9, we give a heuristic method to 
derive SPDEs for sampling, which applies in greater generality than the spe- 
cific setups considered here. Specifically, we show how to derive the SPDE 
when the drift vector field in (1.1) is not of the form "linear plus gradient"; 
for signal processing, we show how to extend beyond the case of linear ob- 
servation equation (1.2). This section will be of particular interest to the 
reader concerned with applying the technique for sampling which we study 
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here. The gap between what we conjecture to be the correct SPDEs for sam- 
pling in general and the cases considered here points to a variety of open 
and interesting questions in stochastic analysis; we highlight these. 

To avoid confusion, we use the following naming convention. Solutions to 
SDEs like (1.1), which give our target distributions, are denoted by upper 
case letters like X. Solutions to infinite-dimensional Langevin equations like 
(1.3), which we use to sample from these target distributions, are denoted 
by lower case letters like x. The variable which is time in equation (1.1) 
and space in (1.3) is denoted by u and the time direction of our infinite- 
dimensional equations, which indexes our samples, is denoted by t. 

2. The abstract framework. In this section, we introduce the abstract 
setting for our Langevin equations, proving existence and uniqueness of 
global solutions. We treat the nonpreconditioned equation (1.3) and the pre- 
conditioned equation (1.4) separately. The two main results are Theorems 
2.6 and 2.10. Both cases will be described by stochastic evolution equations 
taking values in a real Banach space E continuously embedded into a real 
separable Hilbert space TL. In our applications in the later sections, the space 
TL will always be the space of L 2 functions from [0, 1] to M. d and E will be 
some subspace of the space of continuous functions. 

Our application requires the drift to be a map from E to E* . This is 
different from the standard setup as found in, for example, [5], where the 
drift is assumed to take values in the Hilbert space TL. 

2.1. The nonpreconditioned case. In this subsection, we consider semi- 
linear SPDEs of the form 

(2.1) dx = Cxdt + F(x)dt + V2dw(t), x(0)=x , 

where £ is a linear operator on TL, the drift F maps E into E* , w is a 
cylindrical Wiener process on TL and the process x takes values in E. We 
seek a mild solution of this equation, defined precisely below. 

Recall that a closed, densely defined operator £ on a Hilbert space TL is 
called strictly dissipative if there exists c > such that (x,Cx) < —c\\x\\ 2 for 
every x G T>(C). We make the following assumptions on C 

(Al) Let C be a self-adjoint, strictly dissipative operator on TL which gen- 
erates an analytic semigroup Sit). Assume that S(t) can be restricted 
to a Co-semigroup of contraction operators on E. 

Since — C is self-adjoint and positive, one can define arbitrary powers of 
— C. For a > 0, let TL a denote the domain of the operator (— C) a endowed 
with the inner product (x,y) a = ((—C) a x, {—C) a y). We further define TL~ a 
as the dual of TL a with respect to the inner production TL (so that TL can be 
seen as a subspace of TL~ a ). Denote the Gaussian measure with mean f^GTL 
and covariance operator C on TL by ftf(fi,C). 
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(A2) There exists an a G (0, 1/2) such that TL a C E (densely), {-C)~ 2a is 
nuclear in TC and the Gaussian measure J\f(0, (— C)~ 2a ) is concentrated 
on E. 

This condition implies that the stationary distribution AA(0, (— C)~ l ) of 
the linear equation 

(2.2) dz = Czdt + V2dw(t) 

is concentrated on E. 

Under assumption (A2), we have the following chain of inclusions: 

n l/2 ^ n a^ E ^ n ^ E *^ n -a ^ U~ 1/2 . 

Since we assumed that E is continuously embedded into TC, each of the 
corresponding inclusion maps is bounded and continuous. Therefore, we can, 
for example, find a constant c with ||x||.e* < c||x||£ for all x E E. Later, we 
will use the fact that, in this situation, there exist constants c\ and c 2 with 

(2.3) \\S(t)\\E^E < ^WSWWn-^n* < c 2 r 2a . 

We begin our study of equation (2.1) with the following, preliminary result 
which shows that the linear equation takes values in E. 

Lemma 2.1. Assume (Al) and (A2) and define the Ti. -valued process z 
by the stochastic convolution 

r t 

(2.4) z(t) = V2 S(t-s)dw(s) Vt>0, 

Jo 

where w is a cylindrical Wiener process on Ti. Then, z has an E-valued con- 
tinuous version. Furthermore, its sample paths are almost surely /3-Holder 
continuous for every /3 < 1/2 — a. In particular, for such (3, there exist con- 
stants C Pt/ 3 such that 

(2.5) Esap\\z(8)\\ p E <C PiP tPP 

s<t 

for every t < 1 and every p>l. 

Proof. Let i be the inclusion map from 7i a into E and j be the inclu- 
sion map from E into TC. Since (x,y) a = ((—C) 2a jix,jiy) for every x,y in 
7i a , one has 

(x,jiy) = (i*j*x,y) a = ((-C) 2a jii*j*x,jiy) 

for every x £ Ti and every y G Ti a . Since Ti a is dense in TC, this implies that 
jii*j* = (— C)~ 2a . Thus, (A2) implies that ii* is the covariance of a Gaussian 
measure on E, which is sometimes expressed by saying that the map i is 
^y-radonifying. 
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The first part of the result then follows directly from [3], Theorem 6.1. 
Conditions (i) and (ii) there are direct consequences of our assumptions (Al) 
and (A2). Condition (iii) there states that the reproducing kernel Hilbert 
space Tit associated with the bilinear form (x,C~ 1 (e ct — l)y) has the prop- 
erty that the inclusion map Tit^Eis 7-radonifying. Since we assumed that 
C is strictly dissipative, it follows that Ht = 'H 1 ^ 2 ■ Since we just checked that 
the inclusion map from H l/2 into E is 7-radonifying, the required conditions 
hold. 

If we can show that E||z(i + h) — z{t)\\ E < C\h\ l l 2 ~ a for some constant C 
and for h £ [0, 1], then the second part of the result follows from Fernique's 
theorem [10] combined with Kolmogorov's continuity criterion [19], Theorem 
2.1. One has 

E\\z(t + h)- z{t)\\ E <K\\S(h)z(t)-z{t)\\ E + V2E / S(s)dw(s) 

Jo 

The random variable z(t) is Gaussian on Ti with covariance given by 

Q t = {-Cy\l-S{2t)). 
This shows that the covariance of (S(h) — I)z(t) is given by 

(S(h) - I)Q t (S(h) -I) = {-C)- a A a (I - S(2t))A a (-C)- a , 

with 

A a =(-cr-v 2 (s(h)-i). 

Since (A2) implies that (— C)~ a is 7-radonifying from H to E and (S(2t) — I) 
is bounded by 2 as an operator from TC to 7Y, we have 

T^CWAaU^KClhl 1 / 2 -*, 

where the last inequality follows from the fact that C is self-adjoint and 
strictly dissipative. The bound on T2 can be obtained in a similar way. 
From Kolmogorov's continuity criterion, we get that z has a modification 
which is /3-H61der continuous for every (3 < 1/2 — a. 

Since we now know that z is Holder continuous, the expression 

\\z(s) - z(t)\\ E 

(2.6) sup -g 

s,te[o,i] |t — s \ 

is finite almost surely. Since the field |f^pp is Gaussian, it then follows 
from Fernique's theorem that (2.6) also has moments of every order. □ 

Remark 2.2. The standard factorization technique ([5], Theorem 5.9) 
does not apply in this situation since, in general, there exists no interpolation 
space Hfi such that HP C E and z takes values in "HP : for HP C E, one would 
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require (5 > 1/4, but the process takes values in 'HP only for (3 < 1/4. Lemma 
2.1 should rather be considered as a slight generalization of [5], Theorem 
5.20. 

Definition 2.3. The subdifferential of the norm || • \\e at x G E is de- 
fined as 

<9||x||.E = {x* G E*\x*(x) = \\x\\e and x*(y) < \\uWe Vy G E}. 

This definition is equivalent to the one in [5], Appendix D and, by the 
Hahn-Banach theorem, the set is nonempty. We use the subdiffer- 

ential of the norm to formulate the conditions on the nonlinearity F. Here 
and below, C and N denote arbitrary positive constants that may change 
from one equation to the next. 

(A3) The nonlinearity F:E^E* is Frechet differentiable with 

\\F{x)\\e* <C(1 + \\x\\ e ) n and \\DF(x)\\ e ^e* < C(l + \\x\\ E ) N 
for every x G E. 

(A4) There exists a sequence of Frechet differentiable functions F n :E^E 
such that 

lim ||F n (x)-F(x)||_ Q = 

n — >oo 

for all x £ E. For every C > 0, there exists a K > such that for 
all x G E with \\x\\e < C and all n G N, we have ||-F n (x)||_Q, < K. 
Furthermore, there is a 7 > such that the dissipativity bound 

(2-7) (x*,F n (x + y))< -7NU 

holds for every x* G <9||a;||.E and every x,y G E with ||a;||_E > C(l + 

llvlb)"- 

As in [5], Example D.3, one can check that in the case E = C([0, 1],R ), 
the elements of (9||x||_e can be characterized as follows: x* G <9||:e||£; if and 
only if there exists a probability measure on [0, 1] with supp|x*| C {u G 
[0,l]||x(n)| = Hxlloo} and such that 

(2.8) x*(y) = |^(u),|^^|x*|(du) 

for every y G E. Loosely speaking, the dissipativity condition in (A4) then 
states that the drift F n points inward for all locations u G [0, 1], where |x(u)| 
is largest and thus acts to decrease 
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Definition 2.4. An ^-valued and (.T-^-adapted process x is called a 
mild solution of equation (2.1) if almost surely 

(2.9) x(t) = S(t)x + f S(t- s)F(x(s))ds + z{t) V* > 



o 

holds, where z is the solution of the linear equation from (2.4). 

Lemma 2.5. Let C satisfy assumptions (Al) and (A2). Let F:E^> 
E* be Lipschitz continuous on bounded sets, tp : M+ — > E be a continuous 
function and xo £ TC 1 ^ 2 . Then, the equation 

dx 

(2.10) —(t) = Cx{t)+F{x(t) + iP(t)), x{0) = x 

has a unique, local, Ti 1 ' -valued mild solution. 

Furthermore, the length of the solution interval is bounded from below 
uniformly in ||a?o [| 1/2 + su Pte[0,i] lhK*)lb- 

Proof. Since ip is continuous, ||V>(t)||.E is locally bounded. It is a straight- 
forward exercise using (2.3) to show that, for sufficiently small T, the map 
M T acting from C([0, T], H 1/2 ) into itself and defined by 

(M T y)(t) = S(t)x + / S(t - s)F(y(s) + ^(s)) ds 





is a contraction on a small ball around the element t S(t)xQ. Therefore, 
(2.10) has a unique local solution in 7i 1 / 2 . The claim on the possible choice 
of T can be checked in an equally straightforward way. □ 

Theorem 2.6. Let C and F satisfy assumptions (A1)-(A4). Then, for 
every xq E E, the equation (2.1) has a global, E -valued, unique mild solution 
and there exist positive constants K p and a such that 

E||x(t)||^<e-^||xo||^ + K p 

for all times. 

Proof. Let z be the solution of the linear equation dz = Cz{t)dt + 
\f2dw and, for n S N, let y n be the solution of 

%(t) = Cy n {t) + F n (y n (t) + z(t)), y„(0) = x , 
dt 

where F n is the approximation of F from (A4). From Lemmas 2.1 and 2.5, 
we get that the differential equation almost surely has a local mild solution. 
We begin the proof by showing that y n can be extended to a global solution 
and obtaining an a priori bound for y n which does not depend on n. 
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Let t > be sufficiently small that y n (t + h) exists for some h > 0. As an 
abbreviation, define f(s) = F n {y n (s) + z(s)) for all s < t + h. We then have 



\y(t + h)h 



S(h)y(t) + J* +k S(t + h- s)f(s) ds 



Since / is continuous and the semigroup S is a strongly continuous contrac- 
tion semigroup on E, we obtain 



E 



< 



< 



\\S(t + h- s)(f(s) - f(t))\\ E + W(S(t + h-s)- S(h))f(t)\\ E ds 



t+h 

S{t + h- s)f(s) ds - hS(h)f{t) 

t+h 

■+h rh 

\\f(s)-f(t)\\ E ds+ / \\(S(r)-S(0))f(t)\\ E dr 
Jo 

= o(h) 
and thus 

\\y(t + h)\\ E = \\S(h)y(t) + S(h)hf{t)\\ E + o{h) < \\y(t) + hf(t)\\ E + o(h) 
as h I 0. This gives 

y(t + h)\\ E - \\y(t)\\ E 



lim sup ■ 

hlO 



< lim 

hlO 



h 

\y(t) + hf(t)\\ E -\\y 

h 



m a x{(y*J(t))\y*€d\\y(t)\\ E }, 



where the last equation comes from [5], equation (D.2). Using assumption 
(A4), we get 

r \\yn(t + h)\\ E -\\y n (t)\\ E 

hmsup < -i\\y n {t)\\E 

HO ri 

for all t > with ||y n (*)lb > c i l + \W)\\e) N ■ 

An elementary proof shows that any continuous function f:[0,T] — > 1R 
with f(t) > /(0) exp(-7t) for a t G (0, T] satisfies limsup(/(s + /i) - f{s))/h > 
—7/(5) for some s G [0,t). Therefore, whenever ||y n (i)lls > C(l + H^WHe)"^ 
for all t G [a, b] , the solution y n decays exponentially on this interval with 

\\yn(t)\\E<\\yn(a)\\Ee-'< {t - a) 

for all t G [a,b]. Thus (see Figure 1), we find the a priori bound 
(2.11) \\y 



E < e~' (t \\x \\ V sup C7e^- s )(l + ||z( S )|| E ) 

0<s<t 



N 




for the solution y n . Using this bound and Lemma 2.5 repeatedly allows us 
to extend the solution y n to arbitrarily long time intervals. 
Lemma 2.5 also gives local existence for the solution y of 



(2.12) 



^(t) = Cy(t) + F(y(t) + z(t)), y(0) = x . 



Once we have seen that the bound (2.11) also holds for y, we obtain the 
required global existence for y. Let t be sufficiently small for y(t) to exist. 
Then, using (2.3), 



\yn(s)-y(s)\\ E <C 



S{s - r)(F n (y n + z) - F(y + z)) dr 



<C f\s- ry 2a \\F n {y n + z) - F{y + z)\\. a 
Jo 



dr 



for every s < t and thus 



/ \\y n -y\\Eds<C / {s-ry 2a \\F n (y n + z)-F(y n + z)\\_ a 
Jo Jo Jo 



dr ds 



+ C 



(s - r)- Za \\F(y n + z)- F(y + z)\\_ a drds 



to Jo 
= :C(h+I 2 ). 

The map F : E — > E* is Lipschitz on bounded sets and thus has the same 
property when considered as a map E — ► TL~ a . Using (2.11) to see that there 
is a ball in E which contains all y n , we get ||-F(y n + z) — F(y + z)||_ Q < 
C\\y n — y\\-a- Fubini's theorem then gives 



\Vn{r) -y{r) 



(s-r)~ 2a dsdr 
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< 



t 



\-1a 



1 -2a 



\Vn{r) -y(r)\\ E dr 



and, by choosing t sufficiently small and moving the /2-term to the left-hand 
side, we find 



\Vn(r) -y{r)\\Edr < CI\. 



- a in the integral is uniformly 
2a M is an integrable, uniform 



By (A4), the term \\F n (y n + z) - F(y n + z] 
bounded by some constant K and thus (s — r 
upper bound for the integrand. Again by (A4), the integrand converges to 
pointwise, so the dominated convergence theorem yields 



(2.13) 



\Vn(r) - y{r)\\E<lr < Cl\ 







as n — > 00. Assume (for the purposes of obtaining a contradiction) that y 
violates the bound (2.11) for some time s £ [0, t]. Since 1 1— > y(t) is continuous, 
the bound is violated for a time interval of positive length, so Jq ||yn(?") — 
y{r)\\E<lr is bounded from below uniformly in n. This contradicts (2.13), so 
y must satisfy the a priori estimate (2.11). Again, we can iterate this step 
and extend the solution y of (2.12) and thus the solution x = y + z of (2.1) 
to arbitrary large times. 

Now, all that remains, is to prove the given bound on E||a;(i)||^. For k £ N, 
let a k = swp k _ 1<t<k \\z(t)\\ E and 



£ fe = sup V2 

k+l<t<k+2 



S(s — k) dw(s) 



The are then identically distributed and, for \k — 1\ > 2, the random vari- 
ables and are independent. Without loss of generality, we can assume 
^(^xll^- < e~ te ||x||£; for some small value e > [otherwise, replace C with 
C — el and F with F + el, where e is chosen sufficiently small that (A4) 
still holds for F + el]. Thus, for h £ [1, 2], we get 



\z(k + h)\\ < \\S(h)z 



+ v / 2 



k+h 



S(s — k) dw(s] 



<e £ a k + Ck 



and, consequently, a k+ 2 < e~ £ a k + Ck- Since the and a±, ai have Gaussian 
tails, it is a straightforward calculation to check from this recursion relation 
that the expression J2k=i m e " l ^ m ~ k ^ a k nas bounded moments of all orders 
that are independent of m. Since the right-hand side of (2.11) is bounded 
by expressions of this type, the required bound on the solutions x(t) follows 
immediately, with a = 7 — e. □ 
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2.2. The preconditioned case. In this section, we consider semilinear 
SPDEs of the form 

(2.14) dx = g(Cx + F(x))dt + V2Q 1/2 dw(t), x(0)=x , 

where C, F and w are as before and Q is a self-adjoint, positive linear 
operator on Ti. We seek a strong solution of this equation, defined below. In 
order to simplify our notation, we define C = QC, F = QF and w = g 1 / 2 w . 
Then, w is a (/-Wiener process on Ti and equation (2.14) can be written as 

dx = Cxdt + F(x)dt + V2dw(t), x(0)=x . 

For the operator C, we will continue to use assumptions (Al) and (A2). 
For F, we use the growth condition (A3), but replace the dissipativity con- 
dition (A4) with the following one. 

(A5) There exists N > such that F satisfies 

{x,F(x + y))<C(l + \\y\\ E ) N 

for every x,y 6 E. 

Remark 2.7. Note that (A5) is structurally similar to assumption (A4) 
above, except that we now assume dissipativity in Ti rather than in E. 

We make the following assumption on Q: 

(A6) The operator Q : Ti — > Ti is trace class, self-adjoint and positive definite, 
the range of Q is dense in Ti and the Gaussian measure ftf(0,G) is 
concentrated on E. 

Define the space Ti to be T>{Q~ 1 ^ 2 ) with the inner product (x, y)^ = (x, G~ l y). 
We then assume that Q is equal to the inverse of C, up a "small" error in 
the following sense. 

(A7) We have QC = —I + KL, where K, is a bounded operator from Ti to 

Ti. 

Lemma 2.8. Assume (Al), (A2), (A6), (A7). Then, H = Ti 1/2 . In par- 
ticular, TicE. 

Proof. First, note that by [24], Theorem VII.1.3, the fact that QC is 
bounded on Ti implies that range(^) C "D(C). Furthermore, (A7) implies 
that T>(£) C Ti. For every x £ ranged, one has 

\\\x\\ 2 n -\\x\\ 2 1/2 \ = \(g- 1 / 2 x,g- 1 / 2 icx)\ 
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so the norms and ||a;||i/2 are equivalent. In particular, we have 

range(g)cV(£)cHcH 1/2 . 

The facts that range(<5) is dense in TC and T>(£) is dense in TC 1 ^ 2 conclude 
the proof. □ 

Definition 2.9. An ^-continuous and adapted process x is called a 
strong solution of (2.14) if it satisfies 

(2.15) x (t) = x + f {G£x(s) + gF(x(s)))ds + V2w(t) Vt > 

J o 

almost surely. 

Theorem 2.10. Let C, F and Q satisfy assumptions (A1)-(A3) and 
(A5)-(A7). Then, for every xq £ E, equation (2.14) has a global, E -valued, 
unique strong solution. There exists a constant N > and, for every p > 0, 
there exist constants K p , C p and 7 P > such that 

(2.16) n<t)f E < C p (l + llxolb)^"^ + K p 
for all times. 

Proof. Since it follows from (A6) and Kolmogorov's continuity crite- 
rion that the process w(t) is E- valued and has continuous sample paths, it 
is a straightforward exercise (use Picard iterations pathwise) to show that 
(2.14) has a unique strong solution lying in E for all times. It is possible 
to obtain uniform bounds on this solution in the following way. Choose an 
arbitrary initial condition xq G E and let y be the solution to the linear 
equation 

dy = —ydt + dw(t), y(0)=XQ. 
There exist constants K p such that 

(2.17) ny{t)\\ P E<e- pt \\x4 P E + K p - 

Denote by z the difference z(t) = x(t) — y(t). It then follows that z satisfies 
the ordinary differential equation 

dz 

— = Cz(t) + F(x(t)) + Ky(t), 2(0) = 0. 

Since C is bounded from Iri to Ti. by (A7) and F(x) + fCy £ Iri for every 
x,y £ E by Lemma 2.8, it follows that z{t) G TC for all times. Furthermore, 
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we have the following bound on its moments: 




< -2w||z||^ + (F(x),x-y)fi + {lCy,z)fi 

<C- 2w\\z\\ 2 li + |||^ + C(i + \\ y \\ E f + ^\\]Cyfn 
<-w\\z\\% + C{l + \\y\\ E ) N . 



Using Gronwall's lemma, it thus follows from (2.17) that x satisfies a bound 
of the type (2.16) for every p > 0. □ 

3. Stationary distributions of semilinear SPDEs. In this section, we give 
an explicit representation of the stationary distribution of (2.1) and (2.14) 
when F is a gradient, by comparing it to the stationary distribution of the 
linear equation 



The main results are stated in Theorems 3.4 and Theorem 3.6. 

The solution of (3.1) is the process z from Lemma 2.1 and its stationary 
distribution is the Gaussian measure v = M(0, — £ _1 ). In this section, we 
identify, under the assumptions of Section 2 and with F = U' for a Frechet 
differentiable function U : E — > M, the stationary distribution of the equa- 
tions (2.1) and (2.14). It transpires to be the measure fj, which has the 
Radon-Nikodym derivative 



with respect to the stationary distribution v of the linear equation, where 
c is the appropriate normalization constant. In the next section, we will see 
that there are no other stationary distributions. 

The results here are slight generalizations of the results in [25]. Our sit- 
uation differs from the one in [25] in that we allow the nonlinearity U' to 
take values in E* instead of Ti and that we consider preconditioning for the 
SPDE. We have scaled the noise by \/2 to simplify notation. Where possible, 
we refer to the proofs in [25] and describe in detail arguments which require 
nontrivial extensions of that paper. 

Let ( 

e n)neN) be an orthonormal set of eigenvectors of C in Ti. For n E N let 
E n be the subspace spanned by e±, . . . , e n and let II n be the orthogonal pro- 
jection onto E n . From [25], Proposition 2, we know that, under assumption 
(A2), we have E n C E for every n £ N. 



(3.1) 




£z{t) + V2^-{t) Vt>0, 



0. 



dfi = cexp(U) dv 
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Lemma 3.1. Suppose that assumptions (Al) and (A2) are satisfied. 
There then exist linear operators II n : E — > E n which are uniformly bounded 
in the operator norm on E and which satisfy II n n n = tl n and \\Tl n x — x\\e — ► 
as n — > oo . 

Proof. The semigroup S on Ti can be written as 

oo 

S(t)x = Y,e- tXk {e k ,x)e k 

k=l 

for all x G Ti and t > where the series converges in Since there is a 
constant ci > with ||x||t^ < ci||x||e and, from [25], Proposition 2, we know 
there exists a constant c 2 > with ||efc||_E < C2yf\k, we have 

||e~* Afe (e fe , x)e k \\ E < e~ tXk \\e k \\ H \\x\\ H \\e k \\ E < cic 2 e" tAfc VAfclMIc 

for every € N. Consequently, there is a constant C3 > with 

||e~ tAfc <e fc , x)e fc || s < c 3 * _3/2 A^ 1 jB . 

Now, define II n by 

n 

(3.2) fl n x = Y,e~ tnXk (e k ,x)e k , 

k=l 

where 

(00 
E >^ 
fe=n+l 

[This series converges, since assumption (A2) implies that C~ l is trace class.] 
Then, 

00 

\\(S(t n )-fl n )x\\ E <c 3 t- 3 / 2 X k 1 \\x\\ E = c 3 tf/ 2 \\x\\ E . 

k=n+l 

We have ||n n ||.E < ll^(*n) — n n ||£ + H^^n)!^- Since S is strongly continuous 
on E, the norms HS^tn)^ are uniformly bounded. Thus, the operators II n 
are uniformly bounded and, since t n — ► 0, we have ||II n x — x\\e < ||S'(i n )a; — 
n n x||B + \\S(t n )x — a;||E — ► as n — ► 00. □ 

Since the eigenvectors e n are contained in each of the spaces Ti a , we can 
consider n n , as defined by (3.2), to be an operator between any two of the 
spaces E, E* ,Ti and Ti a for all a G 1. In the sequel, we will simply write II n 
for all of these operators. Taken from Ti to Ti, this operator is self-adjoint. 
The adjoint of the operator fl n from E to E is just the II n we obtain by 
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using (3.2) to define an operator from E* to E* . Therefore, in our notation, 
we never need to write tl* N . As a consequence of Lemma 3.1, the operators 
Il n are uniformly bounded from E* to E* . 

Denote the space of bounded, continuous functions from E to R by Cb(E). 
We state and prove a modified version of [25], Theorem 2. 

Theorem 3.2. Suppose that assumptions (Al), (A2) are satisfied. Let 
Q be a positive definite, self-adjoint operator on TL , let U : E — > R be bounded 
from above and Frechet-differentiable and, for n G N, let (P")t>o be the 
semigroup on Cb(E) which is generated by the solutions of 

(3.3) dx(t) = g n (Cx + F n (x(t))) dt + V2g^ 2 U n dw, 

where U n = U o Ii n , F n = U' n , Q n = Ii n QIi n and w is a cylindrical Wiener 
process. Define the measure fi by 

dn(x)=e u W du{x), 

where v = M(0, — C^ 1 ). Let (Pt)t>o be a semigroup on C\,{E) such that 
P[ l ip(x n ) — ► P t ip(x) for every ip G Cb(E), for every sequence (x n ) with x n G 
E n and x n — > x G E and for every t > 0. The semigroup (Pt)t>o is then 
\i- symmetric. 

Proof. From [25], Theorem 1, we know that the stationary distribution 
of z is v and, from the finite dimensional theory, we know that (3.3) is 
reversible with a stationary distribution fi n which is given by 

dn n (x) = c n e Un(x) du n (x), 

where v n = v o II" 1 and c n is the appropriate normalization constant. Thus, 
for all continuous, bounded <p, ip : E — > R, we have 

/ ip(x)Pl l il;(x)diJ,n(x) = / ■ip(x)Pj l (p(x)dfj, n (x) 
Je Je 

and substitution gives 

/ ip(U n x)P t n ^(U n x)e u ^ du{x) 

JE 

= f ^{Ii n x)P^{Ii n x)e u( ^^ du{x) 
Je 

for every t > and every n G N. 

As in the proof of [25], Theorem 2, we obtain H n x — > x in E for z/-a.a. 
x. Since U is bounded from above and continuous and <p,ip £ Cb(E), we can 
use the dominated convergence theorem to conclude 

/ ip(x)P t iP{x)e u ^ dv{x) = [ iP{x)P t cp{x)e u ^ dv(x). 
Je Je 

This shows that the semigroup (Pt)t>o is /i-symmetric. □ 
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3.1. The nonpreconditioned case. We will apply Theorem 3.2 in two dif- 
ferent situations, namely for Q = I (in this subsection) and for Q ~ — C^ 1 
(in the next subsection). The case Q = I is treated in [25], Proposition 5 and 
[25], Theorem 4. Since, in the present text, we allow the nonlinearity U' to 
take values in E* instead of H, we repeat the (slightly modified) result here. 

Lemma 3.3. For n G N, let F n ,F:E — > E* , T>0 and tp n ,ip: [0, T] — >■ E 
be continuous functions such that the following conditions hold: 



• for every r > 0, there exists a K r > such that \\F n (x) — F n (y)\\E* <■ 
i^ r ||x — u\\e for every x,y G E with \\x\\e, \\v\\e < r an d every n G N; 

• F n (x) — > F(x) in E* as n — > oo for every x G E; 

• ipn ^ ip in C([0, T],E) osn-> oo; 

• there exists a p > 1 with 



(3.4) 



f T \\S(s)\\ p E ^ E ds <oo. 
Jo 



Let u n , u : [0, T] — > E be the solutions of 



(3.5) 



Unit) 



Sit- s)F n (u n {s))ds + ipnit), 



(3.6) u(t)= / S(t-s).F(u(s))ds + V(t)- 

Jo 

Then, u n ^>u in C([0, T], E). 
Proof. We have 



\u n (t) - u 



< 



S(t-s)(F n (u(s))-F(u{s)))ds 



+ 



S(t-s)(F n (u n (s))-F n (u( S )))ds 

+ \\Mt)-W)\\E 
= h(t) + hit) + hit) 

for all t G [0, T]. We can choose q > 1 with 1/p + 1/q = 1 to obtain 



/i(*)< / \\S(t-s)(F n (u{s))-F(u(s 



< 



< 



\F n (u(s)) - F(u(s) 



?* ds 



\\S{t-s)\\ E ^ E ds 



i/p 



\F n iu(s)) - F(u(s))\\ Et ds 



V'/ 
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By dominated convergence, the right-hand side converges to uniformly in 
t as n — > oo. 

For n £ N and r > 0, define 

r nir = inf{tG [0,T]\\\u{t)\\ E >r or \\u n (t)\\ E > r}, 

with the convention that inf = T. For t < r n)T . we have 

J 2 (i) < K r T ||5(t - s)|b.^K(s) - u(s)||j5da 
Jo 

and, consequently, 

\\u n {t) - u{t)\\ E < SUp /l(*) + ||^n(t)-^(*)lb 
0<t<T 

+ K r [ \\S(t - s)\\ E *->E\\un(s) - u(s)\\ E ds. 
Jo 

Using Gronwall's lemma, we can conclude that 

\\u n (t) - u(t)\\ E < ( sup h(t) + ||^ n (t) - il>(t)\\ E ) 

\0<t<T / 

xexp(K r J^ \\S(s)\\ E *^ E ds^ 

for all t < T ntr . 

Now, choose r > such that sup 0<t<T < r/2. Then, for suffi- 

ciently large n and all t < r nr , we have ||u n (i) — — r /2 an d thus 

sup 0<t<T ||it(t)||E < r. This implies that r njr = T for sufficiently large n and 
the result follows. □ 

With all of these preparations in place, we can now show that the measure 
fj, is a stationary distribution of the nonpreconditioned equation. The proof 
works by approximating the infinite-dimensional solution of (2.1) by finite- 
dimensional processes. Lemma 3.3 then shows that these finite dimensional 
processes converge to the solution of (2.1) and Theorem 3.2 finally shows 
that the corresponding stationary distributions also converge. 

Theorem 3.4. Let U : E — > M be bounded from above and Frechet differ- 
entiable. Assume that C and F = U' satisfy assumptions (Al)-(A4). Define 
the measure \i by 

(3.7) d^{x) = ce u{x) dv{x), 

where u = M(0, — £ _1 ) and c is a normalization constant. Then, (2.1) has a 
unique mild solution for every initial condition xq £ E and the corresponding 
Markov semigroup on E is pi-symmetric. In particular, fx is an invariant 
measure for (2.1). 
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Proof. Let x e E. From Theorem 2.6, the SDE (2.1) has a mild so- 
lution x starting at xq. Defining ip(t) = S(t)xo + z(t), where z is given by 
(3.1), we can a.s. write this solution in the form (3.6). Now, consider a se- 
quence (xq) with Xq £ E n for all n G N and Xq — > xo as n — > oo. Let Q = I. 
Then, for every n £ N, the finite-dimensional equation (3.3) has a solution 
x n which starts at Xq and this solution can a.s. be written in the form (3.5), 
with ij) n = S(t)xQ + z n (t) and z n = IL n z. From [25], Proposition 1, we get 
that z n — > z as n — ► oo and thus ip n — > ip in C([0, T],E) as n — > oo. 

Define F n as in Theorem 3.2. We then have F n {x) = iI n F(n n x) and thus 
F n (x) — > F(x) as n — > oo for every x £ E. Also, since F is locally Lipschitz, 
and Il n :E ^> E and Il n :E*^E* are uniformly bounded, the -F n are locally 
Lipschitz, where the constant can be chosen uniformly in n. From (2.3), we 
obtain 



for every T > and thus condition (3.4) is satisfied. We can now use Lemma 
3.3 to conclude that x n — > x in C([0,T],E) as n — > oo almost surely. Using 
dominated convergence, we see that PJ l ip{x n ) — ► P t (p(x) for every ip £ C^E) 
and every t > 0, where (-P™) are the semigroups from Theorem 3.2 and 
(Pt)t>o is the semigroup generated by the solutions of (2.1). We can now 
apply Theorem 3.2 to conclude that (Pt)t>o is /^-symmetric. □ 

3.2. The preconditioned case. For the preconditioned case, we require the 
covariance operator Q of the noise to satisfy assumptions (A6) and (A7), in 
particular for Q to be trace class. Thus, we can use strong solutions of (3.3) 
here. The analogue of Lemma 3.3 is given in the following lemma. 

Lemma 3.5. Let T > and, for n £ N, let C n , C be bounded operators on 
E and let F n ,F : E — > E as well as ip n , ip : [0, T] — > E be continuous functions 
such that the following conditions hold: 

• C n x — > Cx and F n (x) — > F(x) in E as n — > oo for every x £ E; 

• for every r > 0, there is a K r > such that 




(3.8) ||F n (x) - F n {y)\\ E < K r \\x - y\\ E 



for every x,y £ E with ||x||_g, \\y\\E < f an d every n £ N; 
• VVi — > ^ in C([0,T], E) as oo. 

Let u n ,u: [0, T] — > E be solutions of 



(3.9) 




(3.10) 
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then u n ^u in C([Q,T],E). 

Proof. We have 

\\Un(t) ~ u(t)\\ E 

< [ \\C n u{s) - Cu{s) + F n (u{s)) - F(u{s))\\ E ds 
Jo 

+ [ \\C n u n (s) - C n u(s) + F n (u n (s)) - F n (u(s))\\ E ds 
Jo 

+ w^ n (t)-m\\E 
=i 1 (t)+i 2 {t)+i 3 (t) 

for all t £ [0, T]. By the uniform boundedness principle, we have sup neN ||£/n||-E < 
oo and thus we can choose K r sufficiently large to obtain 

\\C n x - C n y + F n (x) -F n (y)\\ E <K r \\x-y\\ E 

for every x,y £ E with \\x\\ E , \\y\\ E < r and every n £ N. We also have 

sup /i(i)< / \\C n u(s) - Cu(s) + F n (u(s)) - F(u(s))\\ E ds — >0 

0<t<T Jo 

as n — > oo, by dominated convergence. 
For n £ N and r > 0, define 

r„, r = inf{i G [0,T]|K<)||b > r or \\u n (t)\\ E > r}, 

with the convention that inf = T. For t < r n)r , we have 

h(t)<K r [ \\u n (s) -u(s)\\ E ds 
Jo 

and, consequently, 

\\u n (t)-u(t)\\ E < sup h(t) + K r ( \\u n (s)-u{s)\\ E ds+\\tp n {t)-ip(t)\\ E . 

0<t<T JO 

Using Gronwall's lemma, we can conclude that 

\\u n (t)-u(t)\\ E <e KrT ( sup Ii(t) + ||y> n (i)-^(i)||j 

\0<t<T 

for all t < T nir . 

Now, choose r > such that sup < t <T' ||«(f)||.E < r/2. For sufficiently large 
n and all t < r njr , we then have ||ti n (£) — W (*)ll-E < r/2 and thus 
su Po<t<T im^H-E < r. This implies that r njT . = T for sufficiently large n and 
the result follows. □ 
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The following theorem shows that the measure is now also a stationary 
distribution of the preconditioned equation. Again the proof works by ap- 
proximating the infinite-dimensional solution of (2.1) by finite-dimensional 
processes. 

Theorem 3.6. Let U:E^>M be bounded from above and Frechet dif- 
ferentiable. Assume that the operators Q and C and the drift F = U' satisfy 
assumptions (Al)-(A3), and (A5)-(A7). Define the measure \x by 

(3.11) dfi(x) = ce u{x) diy(x), 

where u=M(0, — £ _1 ) and c is a normalization constant. Equation (2.14) 
then has a unique strong solution for every initial condition xq G E and 
the corresponding semigroup on E is fi- symmetric. In particular, [i is an 
invariant measure for (2.14). 

Proof. Let xq G E. From Theorem 2.10, SDE (2.14) has a strong so- 
lution x starting at xq. Defining ip(t) = x$ + w(t), where w = Q l / 2 w is a 
(/-Wiener process, we can a.s. write this solution in the form (3.10). Now, 
consider a sequence (xq) with Xq G E n for all n G N and Xq — > x$ as n — > oo. 
For every n G N, the finite-dimensional equation (3.3) then has a solution 
x n which starts at Xq and this solution can a.s. be written in the form (3.9), 
with 4) n = x% + ti n g l / 2 w{t). Since the function ip is continuous, it can be 
approximated arbitrarily well by a piecewise affine function ip. Since the op- 
erators ti n are equibounded in E and satisfy Tl n y — > y for every y G E, it is 
easy to see that ti n ip — > ip in C([0,T],E). On the other hand, \\ip n — Il n ip\\E 
is bounded by 1 1 tl n xo — Xq \\ e + || Iln 1 1 e^e \ \ i> — i> \ \ E , so it also gets arbitrarily 
small. This shows that vjj n indeed converges to ip in C([0,T],E). 

Because of (A6) and (A7), we have < oo and < oo. 

Let F = U' and define F n and Q n as in Theorem 3.2. We then have F n (x) = 
Tl n F(n n x). Let C n = g n C = fl n gCfl n , £ = GC, F n = Q n F n and F = QF. 
Since Uriralls-^i; < c for all u£N and some constant c < oo and since ||II n 2; n — 
< ||n ra x n — n n x||£; + ||II n 2; — x\\e, we have II n x n — > x in E as n — ► oo 
for every sequence (x n ) with x n — > x in E. Since QC is a bounded operator 
on E, we can use this fact to obtain C n x — > Cx in E as n — > oo for every 
x G E. Since QL is bounded from E to E and C(E) 5 £(H 1/2 ) = H~ 1/2 , the 
operator Q is defined on all of E* C 7i~ 1 / 2 and thus bounded from E* to 
E and we obtain F n (x) — ► F(x) in E as n — > oo for every x € E. Since F 
is locally Lipschitz and the H n are uniformly bounded, both as operators 
from E to E and from E* to E* , the i 7 ^ are locally Lipschitz, where the 
constant can be chosen uniformly in n. Therefore, all of the conditions of 
Lemma 3.5 are satisfied and we can conclude that x n — > x in C([0,T], E) as 
n — > oo almost surely. 
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Using dominated convergence, we see that P™ip(x n ) — > Pt(f(x) for every 
ip € Cb(E) and every t > 0, where (P™) are the semigroups from Theorem 
3.2 and (Pt)t>o is the semigroup generated by the solutions of (2.1). We can 
now apply Theorem 3.2 to conclude that (Pt)t>o is /u-symmetric. □ 

4. Ergodic properties of the equations. In this section, we show that the 
measure \x from Theorems 3.4 and 3.6 is actually the only invariant measure 
for both (2.1) and (2.14). This result is essential to justify the use of ergodic 
averages of solutions to (2.1) or (2.14) in order to sample from \i. We also 
show that a weak law of large numbers holds for every (and not just almost 
every) initial condition. Theorems 4.10 and 4.11 summarize the main results. 

These results are similar to existing results for (2.1), although our frame- 
work includes nonlinear boundary conditions and distribution- valued forcing 
in the equation. Furthermore, our analysis seems to be completely new for 
(2.14). The problem is that (2.14) does not have any smoothing property. 
In particular, it lacks the strong Feller property which is an essential tool in 
most proofs of uniqueness of invariant measures for SPDEs. We show, how- 
ever, that it enjoys the recently introduced asymptotic strong Feller property 
[12], which can, in many cases, be used as a substitute for the strong Feller 
property, as far as properties of the invariant measures are concerned. 

Recall that a Markov semigroup Vt over a Banach space is called strong 
Feller if it maps bounded measurable functions into bounded continuous 
functions. It can be shown by a standard density argument that if Assump- 
tion 1 holds for Vt, then it also has the strong Feller property. We will not 
give the precise definition of the asymptotic strong Feller property in the 
present article since this would require some preliminaries that are not going 
to be used in the sequel. All we will use is the fact that, in a similar way, if a 
Markov semigroup Vt satisfies Assumption 2, then it is also asymptotically 
strong Feller. 

4.1. Variations of the strong Feller property. Given a Markov process on 
a separable Banach space E, we call Pt the associated semigroup acting on 
bounded Borel measurable functions (p : E — ► R. Let us denote by Cl(E) the 
space of bounded functions from E to R with bounded Frechet derivative. 
For the moment, let us consider processes that satisfy the following property. 

Assumption 1. The Markov semigroup P t maps Cl(E) into itself. Fur- 
thermore, there exists a time t and a locally bounded function C :E — > R_|_ 
such that the bound 

(4.1) ||DP t¥ >(s)||<C(x)H|oo 
holds for every tp : E — > R in C\(E) and every x € E. 
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It is convenient to introduce 

(4.2) B(x) = {yeE\\\y-x\\ E <l}, C(x)= sup C(y). 

yeB(x) 

Note that a density argument given in [6] shows that if (4.1) holds for Frechet 
differentiable functions, then P^ip is locally Lipschitz continuous with local 
Lipschitz constant C(a;)||(/j|| 00 for every bounded measurable function (p. In 
particular, this shows that 

(4.3) \\P t (x, •) - P t (y, -)||tv < hC(x)\\x - y\\ E 

for every x,y £ E with \\x — y\\ E < 1 (with the convention that the total 
variation distance between mutually singular measures is 1). Recall that 
the support of a measure is the smallest closed set with full measure. We 
also follow the terminology in [6, 23] by calling an invariant measure for a 
Markov semigroup ergodic if the law of the corresponding stationary process 
is ergodic for the time shifts. The following result follows immediately. 

Lemma 4.1. Let Pt be a Markov semigroup on a separable Banach space 
E that satisfies (4.3) and let // and v be two ergodic invariant measures for 
Pt- If n^v, then we have \\x — y\\ > min{l, 2/C(x)} for any two points 
(x,y) G supp/U x suppiA 

Proof. Assume (for the purposes of obtaining a contradiction) that 
there exists a point (x,y) G supp/z x suppf with \\x — y\\ < 2/C(x) and 
|| a; — y\\ < 1. Let 5 <1 — \\x — y\\ be determined later and let Bs(x) denote 
the ball of radius 5 centered at in x. With these definitions, it is easy to 
check from (4.3) and the triangle inequality that we have 

\\p t ( x ', •) - p t ( y >, .)||tv < U 2S + \\ x - y\\) c ( x ) 

for every x' E B$(x) and y' G Bs{y). Since we assumed that \\x — y||C(x)/2 < 
1, it is possible, by taking 5 sufficiently small, to find a strictly positive a > 
such that 

\\P t (x',-)-Pt(y',-)\\TV<l-a. 
The invariance of ^ and v under Pt implies that 

||/i-z/|| T v < / \\Pt(x,-) ~ Pt{y,-)\WvlJ>{dx)v(dy) < 1 - a^{B & {x))v{Bs{y)). 
Je 2 

Since the definition of the support of a measure implies that both fj>(B$(x)) 
and i>(B${y)) are nonzero, this contradicts the fact that fi and v are distinct 
and ergodic, therefore mutually singular. □ 

In our case, it turns out that we are unfortunately not able to prove 
that (4.1) holds for the equations under consideration. However, it follows 
immediately from the proof of Lemma 4.1 that we have the following, very 
similar, result. 
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Corollary 4.2. Let Pt be a Markov semigroup on a separable Banach 
space E such that there exists a continuous increasing function f : K + — > IR + 
with /(O) = 0, /(l) = 1 and 

(4.4) ||P t (x, •) - P t (y, .)||tv < C^/dlx - y||) 

/or every x,y £ E with \\x — y\\ < 1. Xei /i and ^ be two ergodic invariant 
measures for P t . If [i ^ v, then we have f(\\x — y||) > min{l, 1/C(x)} for 
any two points (x,y) 6 supp/U x suppz;. 

In Theorem 4.7 below, we will see that the semigroups generated by the 
nonpreconditioned equations considered in the present article satisfy the 
smoothing property (4.4). However, even the slightly weaker strong Feller 
property can be shown to fail for the semigroups generated by the precon- 
ditioned equations. They, however, satisfy the following, somewhat weaker, 
condition. 

Assumption 2. The Markov semigroup P t maps Cl{E) into itself. Fur- 
thermore, there exists a decreasing function / : M + — > K + converging to at 
infinity and a locally bounded function C : E — > such that the bound 

(4.5) \\DPMx)\\ < G^xXIMU + f(t)\\Df\\oo) 
holds for every tp : E — > R in C\(E) and every x G Ti.. 

A modification of the argument of Lemma 4.1 yields the following. 

Lemma 4.3. Let Pt be a Markov semigroup on a separable Banach space 
E that satisfies Assumption 2 and let \x and v be two ergodic invariant 
measures for Pf. If \l^v, then we have \\x — y\\ > min{l, 2/C(x)} for any 
two points (x,y) £ supp// x suppz/, where C is given in (4-2). 

Proof. Given a distance d on E, recall that the corresponding Wasser- 
stein distance on the space of probability measures on E is given by 

(4.6) ||7ri - ir 2 \\d = inf / d(x,y)7r(dx,dy), 

7reC(7ri,7r 2 ) JE 2 

where C(tt\, tt2) denotes the set of probability measures on E 2 with marginals 
7Ti and 7T2. 

Given the two invariant measures [i and v, we also recall the useful in- 
equality 

(4.7) \\n-v\\ d < l-min{n(A),u(A)}(l- max .\\P t (z,-) - P t (y,-)\\ d ), 
valid for every t > and every measurable set A (see, e.g., [12] for a proof). 
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For e > 0, we define on TL the distance d £ (x,y) = 1 A e" 1 ^ — y\\ and 
denote by || • || £ the corresponding seminorm on measures given by (4.6). It 
can be checked from the definitions that, in a way similar to the proof of 
[12], Proposition 3.12, (4.5) implies that the bound 



holds for every (x,y) 6 E 2 with ||x — y\\ < 1. Hence, the same argument as 
in the proof of Lemma 4.1 yields a > 0, so, for 5 sufficiently small, we have 
the bound 



for every x' G Bg(x) and y' G B$(y). Note that 5 can be chosen independently 
of e. Choosing t as a function of e sufficiently large so that f{t) < ae/2, say, 
it follows from (4.7) that 



for every e > 0. Since lim^o ||/ii — ^We = llw — /-^Htv (see [12]), the claim 
follows in the same way as in the proof of Lemma 4.1. □ 

4.2. Conditions for (4.4) to hold. In this subsection, we show that equa- 
tion (2.1), arising from the nonpreconditioned case, satisfies the bound (4.4). 
Our main result is the following theorem. 

The proof of the results is closely related to standard arguments that can 
be found, for example, in [4, 6, 17]. However, the situation in these works is 
different from ours, mainly because we only have local bounds on the deriva- 
tive of the flow with respect to the initial condition. This forces us to use an 
approximation argument which, in turn, only yields a bound of type (4.4) 
rather than the bound (4.1) obtained in the previously mentioned works. 
The present proof unfortunately requires (4.8) as an additional assumption 
on the nonlinearity F, even though we believe that this is somewhat artifi- 
cial. 

Theorem 4.4. Suppose that assumptions (Al)-(A4) hold. Assume, fur- 
thermore, that for every R > 0, there exists a Frechet differentiable function 
F R :E^E* such that 





fJ-i ~ A*2||e < 1 - « 2 min{/xi(B,5(x)), /j. 2 (B s (x))} 



F R {x) = F{x) 



for \\x\\ E < R, 



(4.8) 



Fr(x) = 



for \\x\\ E >2R 



and such that there exist constants C and N such that 



\\F R {x)\\ + \\DF R {x)\\<C{l + R) 



N 
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for every x G E. There then exist exponents N > and a > sitc/i t/iat 
i/ie solutions to the SPDE (2.1) satisfy (4.4) u>i£/i /(r) = r a and C(x) = 

(l + \\x\\ E f- 

PROOF. Fix xq£ E and define R = 2||xo||_b- Denote by & R : E —* E the 
flow induced by the solutions to the truncated equation 

(4.9) dx = Cxdt + F R (x)dt + V2dw(t). 

Further, denote by z the solution to the linearized equation defined in (2.4). 
It follows immediately from Picard iterations that & R is Frechet differen- 
tiate and that there exists a constant C such that 

\\*?(x)\\e < \\x\\ E + \\z(t)\\E + Ct l ~ 2a (l + R) N , 



(4.10) 

\\D^(x)\\ E ^ E <2 

for every t with 

(4.11) t l ~ 2a < 



C{1 + R) N ' 

Note that the bounds in (4.10) are almost sure bounds and that (4.11) is a 
deterministic condition on the time interval we are allowed to consider. 

Now, denote by the Markov semigroup generated by (4.9). For an 
arbitrary function ip G C\{E) and an arbitrary vector ^ G E, the Bismut- 
Elworthy-Li formula [6, 9] yields 

\DP t R <p(x)£\ = 7 E (Wf (*)) f\D$?(x)£,dw(s)) 



t V 

<±y\\^(K J* \\D$?(x)zf H dsy 2 . 

Combining this with (4.10) shows that there exists a constant C such that 

(4.12) \\P t R (x, •) - P t R (y, OUtv < ^=\\x - y\\ E , 

provided that t is sufficiently small that (4.11) holds. The bound (4.10) 
shows that there exists 6 > such that 

/ \ n+d 

(4.13) P sup \\ X ( S )\\ E >R)< — 

\se[o,t] / R 

for every t such that (4.11) holds and every xq such that ||xo||e < -R/2. 

Furthermore, it is clear that the solution to (4.9) agrees with the solution 
to (2.1), provided it stays inside a ball of radius R, so (4.13) implies that, 
under the same conditions, we have 

(4-14) \\P t (x,.)-P t R (x,.)\\ Ty <—. 
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for s <t, (4.15) actually implies that 



and all times 
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C Ct e 
(4.15) \\P t (x, •) - P t (y, -)||tv < -j=\\x ~ v\\e + 

for all pairs (x,y) € E x E such that sup{||a;||jg, — R/2 

t satisfying (4.11). Since we have 

||P t (x, •) - P t (y, OIItv < \\P s (x, •) - P s (y, OIItv 



/ (j Cs \ 

\\P t (x, ■) - P t (y, -)Htv < mf (^=||x - y\\ E + -^-J , 

which immediately yields that a bound of the type (4.4) holds, with C(x) 
growing polynomially in ||^||b- D 

Corollary 4.5. Let U :E — >M. be bounded from above and Frechet dif- 
ferentiable. Assume that C and F = U' satisfy the assumptions of Theorem 
4-4- The SDE (2.1) then has a unique stationary distribution, which is given 
by (3.7). 

Proof. Denote by £ the set of all ergodic invariant measures for (2.1). 
It follows from Theorem 3.4 that fx, as given by (3.7), is an invariant mea- 
sure for (2.1), so £ is not empty. Also, note that the support of fi is equal 
to E since the embedding Ti 1 / 2 C E is dense by (A2). Assume, now, that 
£ contains at least two elements, v\ and V2- In this case, it follows from 
Theorem 4.4 that there exists an open set Ac E such that A n suppz/ = 
for every v G £. Since every invariant measure is a convex combination of 
ergodic invariant measures ([23], Theorem 6.6), this implies that (J>(A) = 0, 
which contradicts to the fact that supp/i = E. □ 

Remark 4.6. Since we obtain the strong Feller property for (2.1), as well 
as the existence of a Lyapunov function [see equation (2.16)], we can apply 
the machinery exposed in [18] in order to obtain the exponential convergence 
(in a weighted total variation norm) of transition probabilities to the unique 
invariant measure. The only additional ingredient that is required is the fact 
that the level sets of the Lyapunov function are "small." This can be checked 
by a standard controllability argument. 

4.3. Conditions for (4.5) to hold. In this subsection, we show that the 
equations arising from the non-preconditioned case satisfy a bound of the 
type (4.5). 
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Theorem 4.7. Let C, F and Q satisfy (A1)-(A3) and (A5)-(A7). The 
Markov semigroup on 7i generated by the solutions of (2.14) then satisfies 
the bound (4.5), with C(x) < C(l + for some constants C and N . 

In particular, it is asymptotically strong Feller. 

Remark 4.8. Note that it is not generally true that these assumptions 
imply that the process is strong Feller. A counterexample is given by the 
case where C is minus the identity, F = and Q : TL — ► TL is any positive 
definite trace class operator. This counterexample comes very close to the 
situation studied in this paper, so the strong Feller property is clearly not 
an appropriate concept here. 

Proof of Theorem 4.7. It follows from standard arguments that the 
evolution map $ Sj f :ExQ^E is Frechet differentiable. In the sequel, we 
denote its Frechet derivative by J s j. 

The family of (random) linear operators J s j ■ E — > E is given in the fol- 
lowing way. For every £ & E, J s> t£ solves the equation 

d t J s 4 = £ J s , t £ + DF(x(t))J S)t £, J s ,sC = £■ 
We also define a family of (random) linear operators At :L 2 ([0, t],TC) — ► E 

by 

A t v= [ J s , t g^ 2 v(s)ds. 
Jo 

This is well defined since Q 1 / 2 maps 7i into E by Lemma 2.8. Recall that 
Atv is the Malliavin derivative of the flow at time t in the direction of the 
Cameron-Martin vector v. We will also denote this by Atv = T> v Qq^- 

Given a perturbation £ in the initial condition for x, the idea is to find 
a perturbation v in the direction of the Cameron-Martin space of the noise 
such that these perturbations "cancel" each other for large times t. Given a 
square-integrable TL- valued process v, we therefore introduce the notation 

p(t) = J ,t£ - A t v [0it] , 

where vj denotes the restriction of v to the interval J. Note that p{t) is the 
solution to the differential equation 

(4.16) d t p{t) = Cp{t) + DF{x{t))p{t)-g 1 ' 2 v{t), p (0)=^E. 

The reason for introducing this process p is clear from the approximate 
integration by parts formula (see [12] for more details), which holds for 
every bounded function ip : E — > R with bounded Frechet derivative: 

{DPMx),£)=E((D(<p(x i )),S))=E((D<p)(x t )J 0i tt) 
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= E((D<p)(xt)A t v m ) + E((Dcp)(x t )p t ) 
(4.17) =E(V v M<p(x t ))+M((D<p)(x t )p t ) 

= E^(xt) J Q Hs), dw(s))j + E((D<p)(xt)pt) 




In this formula, it; denotes a cylindrical Wiener process on Ti, so w = Q l l 2 w. 
This formula is valid for every adapted square integrable W-valued process 
v. 

It remains to choose an adapted process v such that p(t) — ► 0. For 
v(t) = g- 1 / 2 (DF(x(t)) + lC)e- t £, 

it is easy to check that equation (4.16) reduces to dtp = —p, so ||p(i)||s = 
e~ *. Furthermore, Theorem 2.10, together with assumptions (A3) and (A7), 
ensures that E||u(i)||^ < C(l + HxHs)^ - ™* for some constants C, N and 
w, so (4.17) immediately implies (4.5). □ 

Corollary 4.9. Let U:E^M be bounded from above, Frechet differ- 
entiable and such that (A1)-(A3) and (A5)-(A7) hold for F = U' . The SDE 
(2.14) then has a unique stationary distribution, given by (3.11). 

Proof. The proof follows exactly the same pattern as the proof of 4.5, 
but we replace references to Theorem 4.4 by references to Theorem 4.7 and 
4.3. □ 



4.4. Law of large numbers. In this section, we use the results of the 
previous section in order to show that the solutions to our equations satisfy 
a law of large numbers. We first introduce the following result. 

Theorem 4.10. Assume that (Al)-(A4) and (4.8) hold and let p be an 
ergodic invariant probability measure for (2.1). We then have 

(4.18) lim — [ ip(x(t))dt= [ ip(x)p(dx) almost surely 

T^oo T Jo Je 

for every initial condition xq in the support of p and for every bounded 
measurable function ip-.E^M.. 

Proof. Denote by A C E the set of initial conditions for which (4.18) 
holds and by S the support of p. We know from Birkhoff 's ergodic theorem 
that p(A) = 1 and therefore that A is dense in 5. Now, let xo £ S and e > 
be arbitrary and choose a sequence of points Xq in A converging to xq. 
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Fix an arbitrary time to > 0. Since, by Theorem 4.4, Pt(x, •) is continuous 
in x (in the topology of total variation), there exists n such that 

(4-19) ||P to (^,-)-Pto(^o,-)||TV<£- 

Let x n {-) denote the trajectories starting from Xq and x(t) denote the tra- 
jectories starting from xq. By the Markov property, the bound (4.19) implies 
that there exists a coupling between the laws of x n {-) and x(-) such that, 
with probability larger than 1 — e, we have x n (t) = x(t) for every t > to- This 
immediately shows that 

lim ^ f ip(x(t))dt = [ y{x)n{dx) 

on a set of measure larger than 1 — e. Since e was arbitrary, the desired 
result follows. □ 

In the preconditioned case, we have the following, somewhat weaker, form 
of the law of large numbers. 

Theorem 4.11. Assume that (A1)-(A3) and (A5)-(A7) hold and let /x 
be an ergodic invariant probability measure for (2.14). We then have 

(4.20) lim — [ ip(x(t))dt = [ (p(x)/J,(dx) in probability 
T-^oo T Jo Je 

for every initial condition x(0) in the support of [i and for every bounded 
function ip: E — > R with bounded Frechet derivative. 

Proof. As before, denote by A E E the set of initial conditions for which 
(4.20) holds and by S the support of ^. Since convergence in probability 
is weaker than almost sure convergence, we know from Birkhoff's ergodic 
theorem that fj,(A) = 1 and therefore that A is dense in S. 

Define 



1 f T 

S$(x) = -J <p(x(t)) dt with x(0) 



x. 



The idea is to use the following chain of equalities, valid for every pair of 
bounded functions ip : E — > R and tp : R — > R with bounded Frechet deriva- 
tives. The symbol D denotes the Frechet derivative of a given function and 
the symbol T> denotes its Malliavin derivative. We have 

rT 



DE^(x)^ = E((D^)(£^ J o (D<p)(x(t))Jo,ttdt 



:E(2?>(^))+E( {Dip)^)- I {Dv)(x(t))p{t)dt 



T 
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Now, denote by ^{x) the law of £^(x). The above chain of inequalities 
shows that 

ll^)-^)lk<c(i + ^^)|| a; - y || 

for some constant C, where || • \\w denotes the Wasserstein distance between 
two probability measures with respect to the distance function 1 A \\x — 
y\\. Since the Wasserstein distance metrizes the weak convergence topology 
and weak convergence to a delta measure is the same as convergence in 
probability to the point at which the measure is located, this implies that 
(4.20) holds for every initial condition x in S. □ 



Remark 4.12. It is possible to extend the above argument to a larger 
class of continuous test functions (p by introducing a time-dependent smooth- 
ing (and possibly cu-toff). 



Remark 4.13. If we wish to obtain a statement which is valid for every 
initial condition, it is, in general, impossible to drop the continuity assump- 
tion on ip. Consider, for example, the trivial dynamic x = —x on IR with 
invariant measure 5q. It is obvious that if we take xq = 1, y(0) = 1 and 
tp(x) = for x / 0, then the left-hand side of (4.20) is 0, whereas the right- 
hand side is 1. 



5. Conditioned SDEs. In this section, we outline how the preceding ma- 
terial can be used to construct SPDEs which sample from the distribution 
of conditioned SDEs. The program outlined here will be carried out in the 
subsequent sections in three specific contexts. 

We start the section by explaining the common structure of the arguments 
used in each of the following three sections; we also outline the common tech- 
nical tools required. We then make some remarks concerning the conversion 
between Hilbert-space-valued SDEs and SPDEs and, in particular, discuss 
how the framework developed in preceding sections enables us to handle the 
nonlinear boundary conditions which arise. 

Consider the following Revalued SDEs, both driven by a <i-dimensional 
Brownian motion, with invertible covariance matrix BB*: 



(5.1) 



dX = AXdu + f{X) du + B dW, 



X(0)=x~ 
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and 

(5.2) dZ = AZ du + B dW, Z(0)=x~. 

Our aim is to construct an SPDE which has the distribution of X, possibly 
conditioned by observations, as its stationary distribution. The construction 
consists of the following steps. We symbolically denote the condition on X 
and Z by C here and we set m(u) = E(Z(u)|C). 

1. Use the Girsanov formula (Lemma 5.2 below) to find the density of 
the distribution C{X) w.r.t. C(Z). 

2. Use results about conditional distributions (Lemma 5.3 below) to de- 
rive the density of the conditional distribution C(X\C) w.r.t. C(Z\C). Using 
substitution, this gives the density of the shifted distribution C(X\C) — m 
w.r.t. the centered measure C(Z\C) — m. 

3. Use the results of the companion paper [14] to obtain an L 2 -valued 
SDE which has the centered Gaussian measure C(Z\C) — m as its station- 
ary distribution. This also gives a representation of m as the solution of a 
boundary value problem. 

4. Use the results of Sections 2 and 3 and the density from step 2 to 
derive an C([0, l],R rf )-valued SDE with stationary distribution C(X\C) — m. 
Use the results of Section 4 to show ergodicity of the resulting SDE. 

5. Write the L 2 -valued SDE as an SPDE, reversing the centering from 
step 2 in the process. 

Combining all of these steps leads to an SPDE which samples from the 
conditional distribution C(X\C) in its stationary measure. In the remaining 
part of this section, we will elaborate on the parts of the outlined program 
which are common to all three of our applications. 

We will assume throughout the rest of this article that the drift / for X 
is of the form / = —BB*W, where the potential V satisfies the following 
polynomial growth condition. 

(M) The potential V : M. d — > R is a C 4 -function which can be written as 

V{x) = M{x,...,x) + V(x), 
where M : (R d f p -> R is 2p-linear with 

M(x,...,x) >c\x\ 2p Vx G M. d 
for some pSN and c > 0, and V : M. d — > R satisfies 
\D k V(x)\ ^ , , _ 



for every /c-fold partial derivative operator D k with k = 0, ... ,4. 
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Under condition (M), the potential V is bounded from below and grows like 
\x\ 2p as |sc| — ► oo. From [16], Section 2.3, Theorem 3.6, we know that under 
this condition on /, the SDE (5.1) has a unique nonexploding solution. 

Later, when checking assumption (A4) and the boundedness of U in The- 
orem 3.4, we have to estimate terms which involve both the nonlinearity / 
and the linear part A of the drift. If condition (M) is satisfied for p > 1, we 
will get the estimates from the superlinear growth of /. For p = 1, we use 
the following, additional assumption on A. 

(Q) For p = 1, the matrices A, B from (5.1) satisfy QA + A*Q - QBB*Q < 
(as a symmetric matrix), where Q € M. dxd is the symmetric matrix 
defined by the relation M(x, x) = \{x, Qx) for all x G M. d . 

Notation 5.1. Introduce the inner product and related norm 
{a,b) B = a*(BB*)~ 1 b, \a\ 2 B = {a,a) B , 
defined for any invertible B. 

The densities in step 1 above will be calculated from the Girsanov formula. 
As an abbreviation, let 

(5.3) $ = i(|/|| + div/). 

When expressed in terms of V, this becomes 

$ = \(\B*W\ 2 - (BB*) : D 2 V), 

where ":" denotes the Frobenius inner product and D 2 V denotes the Hessian 
of V. 



Lemma 5.2. Assume that (5.1) has a solution without explosions on the 
interval [0, 1]. Let Q (resp. P) be the distribution on path space C([0, l],M rf ) 
of the solution of (5.2) [resp. (5.1)/. Then, 

dP(Z) = -^dQ(Z), 

where 

ln<p(Z) = - f 1 (f(Z(u)),odZ(u)) B + [ (<S>(Z(u)) + (f(Z(u)),AZ(u)) B )du. 
Jo Jo 

Proof. Since X (by assumption) and Z (since it solves a linear SDE) 
have no explosions, we can apply Girsanov's theorem ([8], Theorem 11A) 
which yields 

]n<p(Z) = - ^(B^fiZiu^dWiu))- f ' \\f{Z{u))\ 2 B du. 
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But, 
•i 



(B- l f{Z{u)),dW{u)) 

•l r l 



(f(Z(u)),dZ(u)) B -(f(Z(u),AZ(u)) B du- / \f(Z(u))\ B du. 
J 

Converting the first integral on the right-hand side to Stratonovich form 
gives the desired result. □ 

Writing the Radon-Nikodym derivative in terms of a Stratonovich integral 
in the lemma above is helpful when studying its form in the case of gradient 
vector fields; the stochastic integral then reduces to boundary contributions. 

We will handle the conditioning in step 2 of the program outlined above 
with the help of Lemma 5.3 below. We will use it in two ways: to condi- 
tion on paths X which end at X(l) = x + and, for the filtering/smoothing 
problem where X will be replaced by a pair (X, Y), to condition the signal 
(X(it)) ue [ 0> i] on the observation (Y(u)) u£ [ ^ . Since the proof of the lemma 
is elementary, we omit it here (see Section 10.2 of [7] for reference). 

Lemma 5.3. Let P,Q be probability measures on S x T, where (5,-4.) 
and (T, B) are measurable spaces, and let X : S x T — > S and Y :S x T — > T 
be the canonical projections. Assume that P has a density ip w.r.t. Q and 
that the conditional distribution Q x \y=y exists. The conditional distribution 
~Px\Y=y then exists and is given by 



(5.4) 




dP x \Y= y ^ _ J — (p(x,y), ifc(y)>0, 

dQx\Y=y 



otherwise, 



with c(y) = J s (p(x, y) dQ x \ Y=y (x) for all y€T. 



The linear, infinite-dimensional SDEs from [14] which we will use in step 
3 are defined on the space TL = L 2 ([0, l],M d ) and the generator of the corre- 
sponding semigroup is the self-adjoint operator C on TC which is the exten- 
sion of the differential operator 

L = (d u + A*)(BB*rHdu-A) 

with appropriate boundary conditions. When studying the filtering/smoothing 
problem, the operator L will include additional lower order terms, which we 
omit here for clarity. 

The nonlinear, infinite-dimensional SDEs derived in step 4 are of the 
form (2.1) or (2.14). They share the operator C with the linear equations, 
but have an additional nonlinear drift F :E — > E* , where the space E will 
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be a subspace of C([0, l],R d ). The main difficulty in step 4 is to verify that 
assumptions (A1)-(A4) for the nonpreconditioned case or (Al)-(A3), (A5)- 
(A7) for the preconditioned case hold under conditions (M) and (Q). The 
nonlinearity F is of the form 

(5.5) (F(lo))(u) = <p(u(u)) + ho(u(p))5(u) + hi(u(l))6(l - u) 

for all u E [0, 1], where ip, ho and hi are functions from R d to M. d . The symbols 
6(u) and 5(1 — u) denote Dirac mass terms at the boundaries. The functions 
ip, ho and hi are calculated from the potential V and, in our applications, 
the growth conditions from (A3) will be a direct consequence of condition 
(M). The following lemma, in conjunction with condition (M), will help us 
to verify assumption (A4). 

Lemma 5.4. Let c, 7 > and h : M. d — > M. d be continuous with (h(x),x) < 
~l\ x \ 2 f or every x € M. d with \x\ > c. Then, 

(u*,h(uj)) < — 7IHI00 Vw* G 9||cj||oo 

and for all continuous functions lu : [0, 1] — ► M. d such that \\u}\\oo — c - 

Proof. Using the characterization (2.8) of 9||u>||oo from the remark after 
(A4), we get 

(u\h{u))= f\h(u(u)),^-)\co*\(du) 
Jo \ I^Wl/ 

< -7 / \iv(u)\\LJ*\(du) = -7||u;||oo- 
J 

This completes the proof. □ 



Remark 5.5. The special choice of E, C and F allows us to rewrite the 
Hilbert-space- valued SDEs as Revalued SPDEs in step 5. We obtain SPDEs 
of the following form: 

d t x(t,u) = Lx(t,u) + g(u) + tp(x(t,u)) + h (x{t,0))5{u) + hi(x(t,l))S(l - u) 
+ V2d t w(t,u) V(t,u) € (0,oo) x [0,1], 



D o x(t,0) = a, Dix{t,l) = (3 Vte(0,oo), 

where (p,ho,hi are functions from M. d to M. d , dtw is space-time white noise, 
Di = Aid u + Bi are linear first-order differential operators and a, j3 £ M. d are 
constants. The term g is only nonzero for the filtering/smoothing problem 
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and is then an element of E* . Incorporating the jump induced by the Dirac 
masses into the boundary conditions gives 

D x(t, 0) = a - A (BB*)h (x(t,0)), 

D lX (t,l) =/3 + A x {BB*)hi (x(t,l)) Vie (0,oo). 

With these boundary conditions, the delta functions are removed from the 
SPDE above. 

We call a process x : [0, oo) x [0, 1] — > BL d a mild solution of this SPDE if 
x — m is a mild solution of the W-valued SDE (2.1), where m is a solution 
of the boundary value problem —Lm = g with Dom(0) = a and D\m(V) = (3 
and C is the self-adjoint operator L with boundary conditions Dquj(0) = 
and Diw(l) =0. 

Remark 5.6. When using the preconditioned equation (2.14), we will 
consider evolution equations of the following form: 

d t x(t,u) = -x(t,u) +y(t,u) + V2d t w(t,u) V(t,u)e (0,oo) x [0,1], 

-L y(t, u) = Lix(t, u) + g(u) + <p(x(t, u)) 

+ h (x(t, 0))S(u) + hi(x(t, l))ff(l - tt) 

V(t,u) G (0,oo) x [0,1], 

D y(t,0) = a, D lV (t,l) = P VtG(0,oo), 

where L = Lq + L\, Lq is a second-order differential operator, Li is a dif- 
ferential operator of lower order, is the inverse of —Lq subject to the 
same homogeneous boundary conditions as L and w is a <5- Wiener process. 
Incorporating the induced jump into the boundary conditions as above gives 

D y(t, 0) = a - A {BB*)h (x{t, 0)), 

D iy (t, 1) = (3 + Ai(BB*)/n(x(t, 1)) Vt G (0, oo). 

With these boundary conditions, the Dirac mass is removed from the evo- 
lution equation above. 

We call a process x : [0, oo) x [0, 1] — > M d a strong solution of this SPDE 
if x — m is a strong solution of the W-valued SDE (2.14), where m is a 
solution of the boundary value problem —L^m = g with Dom(0) = a and 
D\m(l) = (3 and C is the self-adjoint operator L = Lq + L\ with boundary 
conditions Dqu(Q) = and Diu(l) = 0. 
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6. Free path sampling. In this section, we will follow the program out- 
lined in Section 5 in order to construct SPDEs whose stationary distribution 
is the distribution of the solution X of the SDE (5.1). The main results are 
Theorems 6.1 and 6.3. We re-emphasize that it is straightforward to gener- 
ate independent samples from the desired distribution in this unconditioned 
case and there would be no reason to use the SPDEs in practice for this 
problem. However, the analysis highlights a number of issues which arise in 
the two following sections, in a straightforward way; we therefore include it 
here. 

We write C_([0, l],M rf ) for the set of all continuous functions from [0,1] 
to M. d with start at x~ . 

Theorem 6.1. Let A G R dxd be a matrix, let B G R dxd be invertible, 
let f = —BB*W, assume that conditions (M) and (Q) are satisfied and let 
x~ G R d . Consider the R d -valued SPDE 

d t x = (8 U + A^BB^idu - A)x - V<S>(x) 

(6.1a) 

- (Df)*(x)(BB*)" 1 Ax - A*{BB*Y l f{x) + V2d t w, 
(6.1b) x(t,0)=x~, d u x(t,l) = Ax(t,l) + f(x(t,l)), 
(6.1c) x(0, u) = xq(u), 

where dtw is space-time white noise and $ is given by (5.3). 

(a) This SPDE has a unique, mild solution for every xq G C_([0, 1],!^) 
and its stationary distribution coincides with the distribution of the solution 
of SDE (5.1). 

(b) For every bounded, measurable function (^:C_([0, l],R d ) — > M and ev- 
ery xq G C_([0, l],M d ) ; we have 

1 f T 

lim — / ip(x(t,-)) dt = E(ip(X)) almost surely, 

T^oo T Jo 

where X is the solution of (5.1). 

Proof. Let X be a solution of (5.1) and let Z be the solution of the 
linear SDE (5.2). From Lemma 5.2, we know that the distribution of X has 
a density (p with respect to the distribution of Z which is given by 

<p(to) = c ■ exp I —V(uj(1)) 

(6-2) 

+ / (VV(Lo(u)),Auj{u)}du- [ §{uj{u))du 
Jo Jo 

for all u G C([0,l],M d ) and some normalization constant c. Let m(u) = 
E(Z(u)) for all u G [0, 1]. The density ip of the distribution fi = C(X — m) 
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w.r.t. the centered distribution v = C{Z — m) is then given by ip(u — m) = 
<p(u) for all wGC([0,l],IR d ). 

Consider the Hilbert space H = L 2 ([0, l],R d ) and the Banach space E = 
{to G C([0, l],M d )|u;(0) = 0} C TL equipped with the supremum norm. Let 
C be the self-adjoint version of (d u + A*)(BB*)~ 1 (d u — A) with boundary 
conditions u>(0) = 0, u/(l) = Aoj(1) on TL- From [14], Theorem 3.3, we know 
that the stationary distribution of the W-valued SDE (3.1) coincides with v. 
By taking expectations on both sides of [14], equation (3.10) in the stationary 
state, we find that m solves the boundary value problem 

{8 U + A*){BB*y\d u - A)m{u) =0 Vu 6(0,1), 

m(0) = x~ , m'(l) = Ara(l). 

Define U : E -> R by U(u) = log(^(cj)) for all uj G E. We then have du = 
exp(U (X)) dv and the Frechet derivative F = U' is given by 

(F(u - m)){u) = -W(w(l))i(l - u) 

(6.3) 

+ D 2 V{uo{u))Auj(u) + A*VV(uj(u)) - V*(w(u)) 

for all w G + m, where 5i G E* is a Dirac mass at u = 1 and L> 2 F denotes 
the Hessian of V. 

We check that the conditions of Theorem 3.4 are satisfied: from [14], 
Theorem 3.3, we know that jV(0, — C" 1 ) is the distribution of Z — m. The 
only nontrivial point to be verified in assumption (Al) is the fact that C 
generates a contraction semigroup on E. This, however, follows immediately 
from the maximum principle. It follows from standard Sobolev embeddings 
that TL a C E densely for every a > 1/4 and [2], Lemma A.l implies that 
A/"(0, —C~ 2a ) is concentrated on E in this case, so assumption (A2) also 
holds. Assumption (A3) is an immediate consequence of condition (M). 

In order to check assumption (A4), define, for n > 1, the function 5 n (u) = 
n X\o,i/n]i where \A denotes the characteristic function of a set A. With this 
definition at hand, we define F n :E^E by 

{F n (u - m))(u) = -VV(u(u))5 n (l - u) 

+ D 2 V{uj{u))Auj{u) + A*VV(lo{u)) - V$(w(u)) 

= :(F n °(o;))(n) + F 1 M W )). 

Since 7i a is contained in some space of Holder continuous functions (by 
Sobolev embedding), (u) — F(u)\\- a = for every uj G 

E. The locally uniform bounds on the F n as functions from E to 7i~ a follow 
immediately from condition (M), so it only remains to check the dissipativity 
bound (2.7). 

We first use the representation (2.8) of the sub differential in E to check 
the condition (oj*,F^(lo + y)) < provided that \\uj\\e is greater than some 
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polynomially growing function of It follows from condition (M) and 

Holder's inequality that there exists an increasing function G : R — > M+ grow- 
ing polynomially with y such that 

- [\cj(u),VV(u + y))6 n (l-u)\cj*\(du) 
Jo 

< - [ (M(w(u), . . .,«(«)) - G(\y(u)\))5 n (l - u)\u*\(du) 

Jo 

< - f 1 (c\u(u)\ 2 p - G(\\y\\ E ))5 n (l - u)\u*\(du) 

Jo 

= -(cIMlt - G(\\y\\)) f 1 5 n (l - u)\<j*\(du), 
Jo 

which is negative for ||o;||.e sufficiently large. In order to check the corre- 
sponding condition for F , we treat the cases p = 1 and p > 1 separately, 
where p is the exponent from condition (M). 

In the case p = 1, we can write V(x) = \ {x, Qx) + V(x) for some positive 
definite matrix Q. We then have 

F 1 (x) = QAx + A*Qx - QBB*Qx + F 1 (x), 

where F 1 has sublinear growth at infinity. Condition (Q) then implies that 
there exists a constant 7 > such that 

(x,F 1 (x)) < -7|x| 2 + IxUF^x)!, 

so that (2.7) follows from 5.4. 

In the case p > 1, it follows from condition (M) that 

(x,F 1 (x)) = -J2M(x,...,x,Be l ) 2 + F 1 (x), 

i 

where F 1 behaves like o(|x| 4p_2 ) at infinity. The nondegeneracy of M thus 
implies that there exist constants 7 > and C such that 

(x,F\x)) = - 7 \x\ 2 + C, 

so that (2.7) again follows from 5.4. 

We finally check that U is bounded from above. In the case p > 1, this 
follows easily from condition (M). In any case, V is bounded from below, so 
that in the case p = l, we have 

U(uj + m)<C+ [\(Mu){u),Auj{u)) -\B*Muj{u)\ 2 )du+ C G{u(u))du 
Jo Jo 

for some function G behaving like o(|x| 2 ) at infinity. It thus follows from 
condition (Q) that U is indeed bounded from above. This concludes the 
verification of the assumptions of Theorem 3.4. 
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We now check that the assumptions of 4.5 hold. The only fact that remains 
to be checked is that (4.8) holds in our case. This can be verified easily since 
the nonlinearity is of the form 



so it suffices to multiply the functions Gi by smooth cut-off functions in 
order to get the required approximations of F. 

From Theorem 2.6, we get that SDE (2.1) has a unique, mild solution for 
every initial condition xo G E. Corollary 4.5 shows that the unique, ergodic 
invariant measure of SDE (2.1) is \i. Converting from a Hilbert-space- valued 
SDE to an SPDE, as outlined in Remark 5.5, we find equation (8.3). This 
completes the proof of statement (a). Statement (b) follows directly from 
Theorem 4.10. □ 

Remark 6.2. If (BB*)~ 1 A is symmetric, the matrix A can be incor- 
porated into the potential V by choosing A = and replacing V(x) with 
V(x) - \(x, (BB^Ax). In this case, the SPDE (6.1) simplifies to the more 
manageable expression 



d t x(t, u) = {BB*)- l dlx{t, u) - V$(x(t, u)) + V2d t w(t, u) 

V(t,u) G (0,oo) x [0,1], 
x(t,0)=x~, d u x(t,l) = f(x(t,l)) Vie(0,oo), 



Similar simplifications are possible for the SPDEs considered in the remain- 
der of this section and in the next. 

Using the preconditioning technique described above, we can construct 
modified versions of the SPDE (6.1) which still have the same stationary 
distribution. In the preconditioned SDE (2.14), we take Q = —C^ 1 , where 
C is the self-adjoint version of (d u + A*)(BB*)~ 1 (d u — A) with boundary 
conditions w(0) = 0, u/(l) = Au(l) on L 2 . 

Theorem 6.3. Let A e R dxd be a matrix, let B G R dxd be invertible, 
f = —BB*W, assume that V satisfies conditions (M) and (Q) and let 
x~ G R d . Denote by L the differential operator (d u + A*){BB*)~ l (d u — A) 
and consider the M. d -valued SPDE 

(6.4a) dtx(t,u) = —x(t,u) + y(t,u) + V2dtw(t,u), x(0, u) = xq(u), 

where w is a Q -Wiener process, $ is given by (5.3) and y(t, •) is the solution 
of the elliptic problem 



(F(v))(u) = Gi(u(u)) + G 2 Ml))(f(l - u) 



x(0, u) 



x (u) 



Vug [0,1]. 



(6.5) 



Ly(t, u) = V$(z(*, u)) + A*{BB*)~ l f{x{t, u)) 
+ ( J D/)*(x(t,n))( J BS*)- 1 Ax(t,u) 
y(t, 0) = x~, 8 u y(t, 1) = Ay(t, 1) + f(x(t, 1)). 
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(a) This SPDE has a unique, strong solution for every xq £ C_([0, ) 
and its stationary distribution coincides with the distribution of the solution 
ofSDE (5.1). 

(b) For every bounded function <p:C- ([0,1], 1R ) — > M with bounded Frechet 
derivative and every xq G C_([0, l],M d ) ; we have 



where X is the solution of (5.1). 

Proof. Choose 7i, E, C, m, \x and U as in the proof of Theorem 6.1. 
From [14], Theorem 3.3, we know that Q is the covariance operator of the 
law of the solution of (5.2) and thus is positive definite, self-adjoint and 
trace class. We have already checked that (Al)-(A3) hold in the proof of 
Theorem 6.1. Furthermore, (A6)-(A7) are trivially satisfied for our choice 
of Q, so it only remains to check (A5) in order to apply Theorem 2.10. Note 
that the nonlinearity F is of the form 



for some functions Fi : W 1 — s> M n . It follows from condition (M) that there 
exist constants C and iV such that both of these functions satisfy 



for every x and y in W 1 . The validity of (A5) follows immediately. 

Applying Theorem 2.10, we obtain that SDE (2.14) has a unique, strong 
solution for every initial condition xq G E. Corollary 4.9 shows that the 
unique, ergodic invariant measure of SDE (2.1) is /i. Converting from a 
Hilbert-space-valued SDE to an SPDE, as outlined in Remark 5.6, we find 
equation (6.1). This completes the proof of statement (a). Statement (b) 
follows directly from Theorem 4.11. □ 

7. Bridge path sampling. In this section, we construct SPDEs which 
sample, in their stationary state, bridges from the SDE (5.1). That is, the 
stationary distribution of the SPDE coincides with the distribution of solu- 
tions of the SDE (5.1), conditioned on X(l) = x + . The main results appear 
in Theorems 7.1 and 7.2. 

Note that, for consistency with the other results in this paper, we con- 
struct an E- valued SPDE theory. However, this functional framework is not 
actually needed for this problem because the boundary conditions are linear; 
it is indeed possible in this case to use a Hilbert space theory. In that func- 
tional setting, results analogous to those in this section are mostly contained 
in [25] and [4]. We also refer to the monographs [5, 6] for related results. 




in probability. 



(F(x))(u) = F 1 (x(u)) + F 2 (x(l))5(l - u) 



(x,F i {x + y))<C{l + \y\) 



N 
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Finally, note that the SPDE (7.1) was also derived (in the one-dimensional 
case) in [20]. 

We write C^([0, l],M rf ) for the set of all continuous functions from [0,1] 



Theorem 7.1. Let A G R dxd be a matrix, let B G R dxd be invertible, 
f = —BB*W, assume that V satisfies conditions (M) and (Q) and let 
x~,x+ G R d . Consider the R d -valued SPDE 



(7.1b) x(t,0)=x~, x(t,i)=x + , 
(7.1c) x(0,u) = xq(u), 

where dtw is space-time white noise and $ is given by (5.3). 

(a) This SPDE has a unique, mild solution for every xq G CjT([0, l],R d ) 
and its stationary distribution coincides with the distribution of the solution 
of SDE (5.1), conditioned on X(l) = x + . 

(b) For every bounded, measurable function ip : Cj([0, —>R and ev- 
ery xq G Ct ( [0, 1] , R d ) , we have 

1 f T 

lim — / ip(x(t, •)) dt = K(ip(X)\X(l) = x + ) almost surely, 
T^oo T Jo 

where X is the solution of (5.1). 

Proof. Let X and Z be the solutions of the SDEs (5.1) and (5.2), 
respectively. From Lemma 5.2, we know that the density of the distribution 
X with respect to the distribution of Z is given by (6.2). Let £(Z\Z(1) = 
x + ) denote the conditional distribution of Z and let m: [0, 1] — > R d be the 
mean of this distribution. Using Lemma 5.3 and substitution, the density of 
fj, = C(X\X(1) = x + ) — m w.r.t. the centered distribution v = £(Z\Z(1) = 
x + ) — m is then given by 



for all oj G C_([0, l],]R rf ) and some normalization constant c. 

Consider the Hilbert space H = L 2 ([0, l],lR d ) and the embedded Banach 
space £ = {u£ C([0, l],]R d )|u;(0) = cj(1) = 0} equipped with the supremum 
norm. Define the operator £ on Ti to be the self-adjoint version of {d u + 
A*)(BB*)~ x (d u - A) with boundary conditions w(0) =u(l) = 0. From [14], 
Theorem 3.6, we know that the stationary distribution of the "H-valued SDE 



to R d which run from x to x 



(7.1a) 



d t x = (8 U + A*){BB*r\d u - A)x - V$(x) 

- (Df)*(x)(BB*)~ 1 Ax - A*(BB*y 1 f(x) + V2d t w, 



ip(co — m) = c • exp 
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(3.1) coincides with v. By taking expectations on both sides of [14], equation 
(3.11) in the stationary state, we find that m solves the boundary value 
problem 

{8 U + A*){BB*)-\d u - A)m{u) =0 VuG (0,1), 

m(0)=x~, m(l) = x + . 

Define U : E —* E by U{u) = \og(ip{uj)) for all uj G E. We then have d\i = 
exp(U(u>)) du and the Frechet derivative F = U' is given by 

(7.2) F(u -m) = D 2 V(uj{u))Au(u) + A*VV{u{u)) - V$(w(u)) 
for all w — m G -B. 

Since (7.2) is the same as (6.3) without the terms involving delta functions, 
we can check that (Al)-(A4) hold in exactly the same way as in the proof 
of Theorem 6.1. From Theorem 2.6, we obtain that SDE (2.1) has a unique, 
mild solution for every initial condition xq G E. Corollary 4.5 shows that 
the unique, ergodic invariant measure of SDE (2.1) is \i. Converting from a 
Hilbert-space-valued SDE to an SPDE, as outlined in Remark 5.5, we find 
equation (8.3). This completes the proof of statement (a). Statement (b) 
follows directly from Theorem 4.10. □ 

Again, we study the corresponding result which is obtained from the pre- 
conditioned SDE (2.14). Since it is, in general, easier to invert the Laplacian 
with Dirichlet boundary conditions rather than C, we choose Q = —Cq 1 , 
where Co is the self-adjoint version of (BB*)~ 1 d 2 with boundary conditions 
w(0) = w(l) = on L 2 . This procedure leads to the following result. 

Theorem 7.2. Let A G R dxd be a matrix, let B G R dxd be invertible, 
f = —BB*W, assume that V satisfies conditions (M) and (Q) and let 
x~,x+ G R d . Consider the R d -valued SPDE 

(7.3a) dtx(t,u) = —x(t,u) + y(t,u) + V2dtw(t,u), x(0, u) = xo(u), 
where w is a Q -Wiener process and y(t,-) is the solution of 



and its stationary distribution coincides with the conditional distribution of 
the solution of SDE (5.1), conditioned on X{\) = x + . 



(BB^dly 



(BB*)~ 1 Ad u x - A*(BB*Y l d u x 

+ A*(BB*)- l Ax + V<$>{x) 

+ (Df)*(x)(BB*)- 1 Ax + A*(BB*y l f{x), 



(7.4) 
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(b) For every bounded function ip:Ct{[0, l],M. d ) — > K with bounded Frechet 
derivative and every xo G Clt ([0, 1],R ), we have 

1 f T 

lim — / <p(x(t, •)) dt = K(tp(X)\X(l) = x + ) in probability, 
T^oo T Jo 

where X is the solution of (5.1). 

Proof. The proof works in almost the same way as the proof of Theo- 
rem 6.3. The primary difference is that, in the present case, the operator Q 
is not the inverse of —C, but only of its leading order part. □ 

8. Nonlinear filter /smoother. Consider the M. d x M m -valued system of 
stochastic differential equations 

dX = f(X)du + B u dW x , X(0)~C, 

(8.1) 

dY = A 21 Xdu + B 22 dW y , Y(0) = 0, 

where B u G M. dxd , A 21 G M. mxd and B 22 G M mxm are matrices, (Sn^)" 1 / 
is a gradient and W x (resp. W y ) is an independent standard Brownian mo- 
tion in W 1 (resp. M m ). We will construct an SPDE which has the conditional 
distribution of X given Y as its stationary distribution. £ is the density of the 
initial distribution for X in (8.1). The main results are stated in Theorems 
8.2 and 8.4. 

Remark 8.1. It is straightforward to extend the contents of this section 
to more general systems of the form 

dX = {A U X + BuB^VV^X)) du + B n dW x , 

dY = (A 21 X + A 22 Y + B 22 B* 22 VV 2 {Y)) du + B 22 dW y 

instead of equation (8.1). (We include the AnX-term in the two previous 
sections.) We do not do so here because it would clutter the presentation. 

Theorem 8.2. Let A 2l G M. mxd , Bu G R dxd and B 22 G W mxm and as- 
sume that Bu and B 22 are invertible. Let f = — BuB*iVV and assume that 
V satisfies conditions (M) and (Q). Let £ be a C 2 probability density such 
that a = e v C, satisfies 

(8.2) max{loga(x), |(Vloga(x),x)} < — e|x| 2 

whenever \x\ > c for some constants e,c > 0. Consider the W 1 -valued SPDE 

d t x(t,u) = {BuBl^dlx^u) - V$(x(t,u)) 

(8.3a) 

+ A^B^B*^)- 1 \-^{u) - A 21 x(t, u)j + V2d t w(t,u) 
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for all (t,u) G (0, oo) x [0,1] with boundary conditions 

(8.3b) S tt x(t,0) = -fliiJ3j 1 VIoga(x(t,0)) > d u x(t, 1) = f(x(t, 1)) 

/or all t G (0, oo) and initial condition 

(8.3c) x(0, u) = xo(u) 

/or aZZ it G [0, 1], where dtw is space-time white noise and <3? is given by (5.3). 

(a) This SPDE has a unique, mild solution for every xq G C([0, l],M rf ) and 
its stationary distribution coincides with the conditional distribution [jlx\y 
of X given Y where X,Y solve (8.1). 

(b) For every bounded, measurable function if : C([0, 1], M rf ) — > M and every 
x Q eC([0,l],l d ), we We 

1 /- T 

lim — / <p{x(t, •)) (if = E(y>(X)|y) almost surely, 
T->oo T Jo 

where X,Y solve (8.1). 

Remark 8.3. The condition (8.2) on a seems to be quite artificial. On 
the other hand, if no a priori information is given on the distribution of X(0) , 
it is natural to assume that X(0) is given by the invariant measure, in which 
case log a = — V, so that the assumptions on a are satisfied. In this case, the 
boundary conditions (8.3b) reduce to the more symmetric expression 

d u x(t, 0) = -f(x(t, 0)),d u x(t, 1) = f(x(t, 1)). 

Proof of Theorem 8.2. Define V : R d x R m -> R by V(x,y) = V{x). 
We can then write the system (X, Y) from (8.1) as an SDE of the form 

(8.4) =A (y) du ~ BB *^V{X,Y) du + Bd f^J 
with 

Let (X,F) be the solution of the linear, R d x Revalued SDE 

(8.5) d(^=A(^dt + Bd(^ y } 

with initial conditions 

X(0) ^jV^e -1 ), F(0) = 0. 

We can use Lemma 5.2 to obtain the density of the distribution [j,xy of 
(X, Y) with respect to the distribution [Ixy °f C^j Since the nonlinearity 
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(/(X),0) is orthogonal to the range of B in R x R m , the resulting density 
is 

(p(u},rj) = exp(v(u(0)) - V(w(l)) - j §(u;{u)) duj 0(w(O)) 

for all (u,rj) G C([0,l],M d x R m ). Here, 0(-) is the density of the distribu- 
tion of X(0), relative to the Gaussian measure AA(0,e _1 ). This density is 
proportional to exp(— V(x) +loga(x) + ^e|x| 2 ). 

From [14], Lemma 4.4, we know that the conditional distribution /j, x \y °f 
X given Y exists and Lemma 5.3 shows that, since <p does not depend on 
Y, fix\Y nas density (p with respect to ^x\Y- Let m be the mean of Hx\y- 
The density tp of fj, = Hx\Y ~ m w.r.t. v = (J- x \y — 771 is then given by 

tp(cj - m) oc exp^loga(cj(0)) + ||w(0)| 2 - V(w(l)) - J <$>(uj{u))du^j 

for all weC([0,l],R d ). 

Consider the Hilbert space H = L 2 ([0, and the embedded Banach 

space E = C([0, l],IR d ) C H equipped with the supremum norm. Define the 
formal second order differential operator 

L = (BuBl^dlx - Al 1 (B 22 B* 2 y l A 21 x. 

Define the operator £ to be the self-adjoint version of L on TC with bound- 
ary conditions u/(0) = eBnB^u^) and u/(l) = 0. From [14], Theorem 4.1, 
we know that the stationary distribution of the "H-valued SDE (3.1) coin- 
cides with v. By taking expectations on both sides of [14], equation (4.2) 
in the stationary state, we find that m solves the boundary value problem 
—Lm(u) = ^2i(^22^22)~ 1 7ri"( u ) f° r an u £ (0; 1) with boundary conditions 
m'(0) = e-B u 5f 1 m(0) and m'(l) = 0. 

Define [7 : — > R by f7(u>) = log(V'(w)) for all uj £ E. We then have d// = 
exp(C/(w)) o!ia The Frechet derivative F = U' is given by 

F(w - m) = loga(w(0))do + ew(0)<5 - VV(w(l))$i - V*(w(u)) 

for all ui £ E, where #0j^i £ F* are delta distributions located at 0, 1, re- 
spectively. 

At this point, we are back in a situation that is very close to the one of 
Theorem 6.1 and we can check that (Al)-(A4) are satisfied. Note that (8.2) 
ensures that U is bounded from above and that the term loga(X(0))<5o ap- 
pearing in F satisfies (2.7). The various statements now follow from Theorem 
2.6 and Corollary 4.5 as in Theorem 6.1. □ 

In the preconditioned version of this theorem, we take Q = — £q , where 
Co is the self-adjoint extension of (BnBl 1 )~ 1 du with boundary conditions 
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o/(0) = and u/(l) = sBuB^ioil) for an e chosen so that (8.2) holds. This 
yields the following result, in which it is important to note that w depends 
upon e. 

Theorem 8.4. Assume that the conditions of Theorem 8.2 hold and 
consider the M. d -valued evolution equation 

dtx(t, u) = —x(t, u) + y(t, u) + V2dtw(t, u), x(0, u) = xo(u), 

where w is a Q -Wiener process and y(t,-) is the solution of the problem 

(BnBl^dly = A^B^B^)' 1 U 21 x - ^) + V*(s), 

with boundary conditions 

d u y(t,0) = £B u B* n (y(t,0) - x(t,0)) - B U B* U V log a(x(t,0)), 
d u y(t,l) = f(x(t,l)). 

As usual, 3> is given by (5.3). 

(a) This SPDE has a unique, strong solution for every xq 6 C([0, 1], M d ) 
and its stationary distribution coincides with the conditional distribution 
^x\Y °f X given Y, where X,Y solve (8.1). 

(b) For every bounded function tp : C([0, 1], W 1 ) — > E with bounded Frechet 
derivative and every xq G C([0, l],R d ), we have 



1 



T 



lim — / <f(x(t, •)) dt = K(tf(X)\Y) in probability, 

T^oo T Jo 

where X,Y solve (8.1). 

Proof. The proof is very similar to that of Theorem 6.3, so we omit it. 

□ 

9. Conclusions. In this paper we derived a method to construct nonlinear 
SPDEs which have a prescribed measure as their stationary distribution. 
The fundamental relation between the drift of the SPDE and the density 
of the stationary distribution is in analogy to the finite dimensional case: 
if we augment the linear SPDE by adding an extra drift term of the form 
F = U' , the stationary distribution of the new SPDE has density exp(U) 
with respect to the stationary distribution of the linear equation. 

Since the resulting SPDEs have unique invariant measures and are er- 
godic, they can be used as the basis for infinite dimensional MCMC methods. 
The applications in Sections 6, 7 and 8 illustrate this approach to sampling 
by constructing SPDEs which, in their stationary states, sample from the 
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distributions of finite dimensional SDEs, conditioned on various types of ob- 
servations. However, our analysis is limited to problems for which the drift 
is linear plus a gradient. Furthermore, in the case of signal processing, the 
analysis is limited to the case where the dependency of the observation on 
the signal is linear. 

We have clear conjectures about how to generate the desired SPDEs in 
general, which we now outline. We start by considering the first two condi- 
tioning problems, 1 and 2. Since we consider the general nongradient case, 
the linear term can be absorbed into the nonlinearity and we consider the 
SDE 

(9.1) dX = f{X)du + BdW x , X(0) = x~. 

In the physics literature, it is common to think of the Gaussian measure 
induced by this equation when / = as having density proportional to 

dZ 2 



qim(Z) =exp^-^ i 



du 



du 

B 



If we denote by 5 the variational derivative of path-space functionals such 
as this and consider the SPDE 

- = Sln qVm (z) + V2— 

(the last term being space-time white noise), then this will sample from 
Wiener measure or Brownian bridge measure, depending on which bound- 
ary conditions are applied. This is an infinite-dimensional version of the 
Langevin equation commonly used in finite-dimensional sampling. General 
linear SPDEs derived similarly are proved to have the desired sampling 
properties in [14]. 

One can use the formal density q given above, in combination with Lemma 
5.2, to derive a formal density on path space for (9.1), proportional to 



<?nonPO =exp^-^ i 



dX .2 1 



du 



+ -divf(X)du 

B z 



This density also appears in the physics literature and is known as the 
Onsager-Machlup functional [11]. The SPDEs, which we derived in Sections 
6 and 7, may be found by considering SPDEs of the form 

dx . r-dW 

— = dlnq non (x) + V2— 

(the last term again being space-time white noise). Again, this may be seen 
as a Langevin equation. Calculating the variational derivative, we see that 
this SPDE has the form 



50 



M. HAIRER, A. M. STUART AND J. VOSS 



where 

G(x) = {BB*)~ 1 Df{x) - Df{ X y{BB*)-\ 

For bridge path sampling, the boundary conditions are those dictated by the 
bridging property. For free path sampling, the variational derivative includes 
a contribution from varying the right-hand end point, which is free, giving 
rise to a delta function; this leads to the nonlinear boundary condition. 

When / has a gradient structure, the operator 0(x) =0 and the SPDE is 
analyzed in this paper (in the case A = 0). When G(x) 7^ 0, it will, in general, 
be necessary to define a new solution concept for the SPDE in order to make 
sense of the product of 0(x) with the derivative of x; in essence, we must 
define a spatial stochastic integral, when the heat semigroup is applied to 
this product term. The Stratonovich formulation of the density q non suggests 
that some form of Stratonovich integral is required. The case where the 
nongradient part of the vector field is linear is considered in this paper, 
and the provably correct SPDE in that case coincides with the conjectured 
SPDE above. 

Turning now to the case of signal processing, we generalize the observation 
equation (1.2) to 

dY = g(X,Y)dt + BdW y , Y(0)=0. 

We can derive the appropriate SPDE for sampling in this case by utilizing 
the Onsager-Machlup functional above and applying Bayes' rule. Define 



q y (X,Y) = exp(-J Q 



1 1 
2 



du 



B 



The Onsager-Machlup density for sampling (X,Y) jointly is 

q(X,Y):=q non (X)q y (X,Y). 

By Bayes' rule, the conditioned density for X\Y will be proportional to 
q(X,Y), with proportionality constant independent of X. Thus, the SPDE 
for sampling in this case should be 

-=6lnq( X ,Y) + V2 — 

(the last term again being space-time white noise and Y being the given 
observation). In the case where g(X, Y) depends only on X and is linear, and 
when f(X) has the form considered in this paper, this SPDE is exactly that 
derived in this paper. Outside this regime, we are again confronted with the 
necessity of defining a new solution concept for the SPDE and, in particular, 
deriving a spatial stochastic integration theory. A related SPDE can also 
be derived when the observations are discrete in time. In this case, delta 
functions are introduced into the SPDE at the observation times; the theory 
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introduced in this paper is able to handle this since a similar issue arises for 
the nonlinear boundary conditions considered here. Langevin SPDEs which 
solve the signal processing problem are discussed in [13]. In that paper, 
numerical experiments are presented which indicate that the conjectured 
SPDEs are indeed correct. 

Finally, let us remark that we are currently unable to treat the case of 
multiplicative noise. We do not believe that this is a fundamental limitation 
of the method, but interpreting the resulting SPDEs will require much more 
careful analysis. 

In addition to the extension of the Langevin equation to nongradient 
problems and more general observation equation, there are many other inter- 
esting mathematical questions remaining. These include the study of second 
order (in time t) Langevin equations, the development of infinite-dimensional 
hybrid Monte Carlo methods, the study of conditional sampling for hypo- 
elliptic diffusions and the derivation of sampling processes for nonadditive 
noise. 
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